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Introduction 



What this Book is About 

Network Calculus is a set of recent developments that provide deep insights into 
flow problems encountered in networking. The foundation of network calculus 
lies in the mathematical theory of dioids, and in particular, the Min-Plus dioid 
(also called Min-Plus algebra). With network calculus, we are able to understand 
some fundamental properties of integrated services networks, window flow control, 
scheduling and buffer or delay dimensioning. 

This book is organized in three parts. Part I (Chapters 1 and 2) is a self con- 
tained, first course on network calculus. It can be used at the undergraduate level 
or as an entry course at the graduate level. The prerequisite is a first undergraduate 
course on linear algebra and one on calculus. Chapter 1 provides the main set of re- 
sults for a first course: arrival curves, service curves and the powerful concatenation 
results are introduced, explained and illustrated. Practical definitions such as leaky 
bucket and generic cell rate algorithms are cast in their appropriate framework, and 
their fundamental properties are derived. The physical properties of shapers are de- 
rived. Chapter 2 shows how the fundamental results of Chapter 1 are applied to 
the Internet. We explain, for example, why the Internet integrated services internet 
can abstract any router by a rate-latency service curve. We also give a theoretical 
foundation to some bounds used for differentiated services. 

Part II contains reference material that is used in various parts of the book. Chap- 
ter 3 contains all first level mathematical background. Concepts such as min-plus 
convolution and sub-additive closure are exposed in a simple way. Part I makes a 
number of references to Chapter 3, but is still self-contained. The role of Chapter 3 is 
to serve as a convenient reference for future use. Chapter 4 gives advanced min-plus 
algebraic results, which concern fixed point equations that are not used in Part I. 

Part III contains advanced material; it is appropriate for a graduate course. Chap- 
ter 5 shows the application of network calculus to the determination of optimal play- 
back delays in guaranteed service networks; it explains how fundamental bounds for 
multimedia streaming can be determined. Chapter 6 considers systems with aggre- 
gate scheduling. While the bulk of network calculus in this book applies to systems 
where schedulers are used to separate flows, there are still some interesting results 
that can be derived for such systems. Chapter 7 goes beyond the service curve deflni- 
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tion of Chapter 1 and analyzes adaptive guarantees, as they are used by the Internet 
differentiated services. Chapter 8 analyzes time varying shapers; it is an extension 
of the fundamental results in Chapter 1 that considers the effect of changes in sys- 
tem parameters due to adaptive methods. An application is to renegotiable reserved 
services. Lastly, Chapter 9 tackles systems with losses. The fundamental result is a 
novel representation of losses in flow systems. This can be used to bound loss or 
congestion probabilities in complex systems. 

Network calculus belongs to what is sometimes called “exotic algebras” or “top- 
ical algebras”. This is a set of mathematical results, often with high description 
complexity, that give insights into man-made systems such as concurrent programs, 
digital circuits and, of course, communication networks. Petri nets fall into this fam- 
ily as well. For a general discussion of this promising area, see the overview paper 
[30] and the book [24]. 

We hope to convince many readers that there is a whole set of largely unex- 
plored, fundamental relations that can be obtained with the methods used in this 
book. Results such as “shapers keep arrival constraints” or “pay bursts only once”, 
derived in Chapter 1 have physical interpretations and are of practical importance to 
network engineers. 

All results here are deterministic. Beyond this book, an advanced book on net- 
work calculus would explore the many relations between stochastic systems and the 
deterministic relations derived in this book. The interested reader will certainly en- 
joy the pioneering work in [24] and [10]. The appendix contains an index of the 
terms dehned in this book. 



Network Calculus, a System Theory for Computer Net- 
works 

In the rest of this introduction we highlight the analogy between network calculus 
and what is called “system theory”. You may safely skip it if you are not familiar 
with system theory. 

Network calculus is a theory of deterministic queuing systems found in com- 
puter networks. It can also be viewed as the system theory that applies to computer 
networks. The main difference with traditional system theory, as the one that was so 
successfully applied to design electronic circuits, is that here we consider another al- 
gebra, where the operations are changed as follows: addition becomes computation 
of the minimum, multiplication becomes addition. 

Before entering the subject of the book itself, let us briefly illustrate some of the 
analogies and differences between min-plus system theory, as applied in this book 
to communication networks, and traditional system theory, applied to electronic cir- 
cuits. 

Let us begin with a very simple circuit, such as the RC cell represented in Fig- 
ure 1. If the input signal is the voltage x{t) € IR, then the output y(t) € IR of this 
simple circuit is the convolution of x by the impulse response of this circuit, which 
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is here h{t) = exp{—t/RC)/RC fort > 0: 

y{t) = (/i ® x){t) = ( h{t — s)x{s)ds. 

JO 

Consider now a node of a communication network, which is idealized as a 
(greedy) shaper. A (greedy) shaper is a device that forces an input flow x{t) to have 
an output y{t) that conforms to a given set of rates according to a traffic envelope a 
(the shaping curve), at the expense of possibly delaying bits in the buffer. Here the 
input and output ‘signals’ are cumulative flow, defined as the number of bits seen on 
the data flow in time interval [0, f]. These functions are non-decreasing with time t. 
Parameter t can be continuous or discrete. We will see in this book that x and y are 
linked by the relation 



yit) 



{a ® x){t) 



inf |(T(f — s) -h a;(s)| . 

sEM. such that 0<s<t 



This relation defines the min-plus convolution between a and x. 




x(t) 

► 




(b) 



Figure 1; An RC circuit (a) and a greedy shaper (b), which are two elementary 
linear systems in their respective algebraic structures. 

Convolution in traditional system theory is both commutative and associative, 
and this property allows to easily extend the analysis from small to large scale cir- 
cuits. For example, the impulse response of the circuit of Figure 2(a) is the convo- 
lution of the impulse responses of each of the elementary cells: 

h{t) = {hi ^ h2){t) = ( hi{t - s)h2{s)ds. 

Jo 

The same property applies to greedy shapers, as we will see in Chapter 1. The 
output of the second shaper of Figure 2(b) is indeed equal to y{t) = (<t ® x){t), 
where 
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(r(i) = (<Ti ® (T 2 )(i) = inf {o'i(f - s) + o' 2 (s)} • 

sGR such that 0<s<i 

This will lead us to understand the phenomenon known as “pay burst only once” 
already mentioned earlier in this introduction. 



•JWg 



X(t) 

-o|- 



hz 



y(t) 



(a) 




Figure 2: The impulse response of the concatenation of two linear circuit is 
the convolution of the individual impulse responses (a), the shaping curve of 
the concatenation of two shapers is the convolution of the individual shaping 
curves (b). 

There are thus clear analogies between “conventional” circuit and system theory, 
and network calculus. There are however important differences too. 

A first one is the response of a linear system to the sum of the inputs. This is 
a very common situation, in both electronic circuits (take the example of a linear 
low-pass filter used to clean a signal x{t) from additive noise n{t), as shown in 
Figure 3(a)), and in computer networks (take the example a link of a buffered node 
with output link capacity C, where one flow of interest x{t) is multiplexed with 
other background traffic n{t), as shown in Figure 3(b)). 

Since fhe elecfronic circuit of Figure 3(a) is a linear system, the response to the 
sum of two inputs is the sum of the individual responses to each signal. Call y{t) 
the response of the system to the pure signal x{t), yn{t) the response to the noise 
n{t), and ytot{t) the response to the input signal corrupted by noise x{t) + n{t). 
Then ytot{t) = y{t) + Unit) - This useful property is indeed exploited to design the 
optimal linear system that will filter out noise as much as possible. 

If traffic is served on the outgoing link as soon as possible in the FIFO order, the 
node of Figure 3(b) is equivalent to a greedy shaper, with shaping curve a{t) = Ct 
for t > 0. It is therefore also a linear system, but this time in min-plus algebra. 
This means that the response to the minimum of two inputs is the minimum of the 
responses of the system to each input taken separately. However, this also mean 
that the response to the sum of two inputs is no longer the sum of the responses of 
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Figure 3: The response ytot{t) of a linear circuit to the sum of two inputs 
a; + n is the sum of the individual responses (a), but the response ytot{t) of 
a greedy shaper to the aggregate of two input flows a; + n is not the sum of 
the individual responses (b). 



the system to each input taken separately, because now x{t) + n{t) is a nonlinear 
operation between the two inputs x{t) and n(t): it plays the role of a multiplication 
in conventional system theory. Therefore the linearity property does unfortunately 
not apply to the aggregate x{t) + n(f). As a result, little is known on the aggregate 
of multiplexed flows. Chapter 6 will learn us some new results and problems that 
appear simple but are still open today. 

In both electronics and computer networks, nonlinear systems are also fre- 
quently encountered. They are however handled quite differently in circuit theory 
and in network calculus. 

Consider an elementary nonlinear circuit, such as the BIT amplifier circuit with 
only one transistor, shown in Figure 4(a). Electronics engineers will analyze this 
nonlinear circuit by first computing a static operating point y * for the circuit, when 
the input x* is a fixed constant voltage (this is the DC analysis). Next they will 
linearize the nonlinear element (i.e the transistor) around the operating point, to 
obtain a so-called small signal model, which a linear model of impulse response h{t) 
(this is the AC analysis). Now xun (t) = x{t)— x* is a time varying function of time 
within a small range around x*, so that yun{t) = y{t) — y* is indeed approximately 
given by yun{t) ~ (/i ® xun){t). Such a model is shown on Figure 4(b). The 
difficulty of a thorough nonlinear analysis is thus bypassed by restricting the input 
signal in a small range around the operating point. This allows to use a linearized 
model whose accuracy is sufficient to evaluate performance measures of interest, 
such as the gain of the amplifier. 

In network calculus, we do not decompose inputs in a small range time-varying 
part and another large constant part. We do however replace nonlinear elements by 
linear systems, but the latter ones are now a lower bound of the nonlinear system. We 
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Controller 





(c) 



(d) 



Figure 4: An elementary nonlinear circuit (a) replaced by a (simplified) linear 
model for small signals (b), and a nonlinear network with window flow control 
(c) replaced by a (worst-case) linear system (d). 



will see such an example with the notion of service curve, in Chapter 1 : a nonlinear 
system y(f) = II (a;) (f) is replaced hy a linear system (f) = (/3®a;)(f), where /3 
denotes this service curve. This model is such that yun{t) < y{t) for all t >0, and 
all possible inputs x{t). This will also allow us to compute performance measures, 
such as delays and backlogs in nonlinear systems. An example is the window flow 
controller illustrated in Figure 4(c), which we will analyze in Chapter 4. A flow x is 
fed via a window flow controller in a network that realizes some mapping y = II (a;) . 
The window flow controller limits the amount of data admitted in the network in 
such a way that the total amount of data In transit in the network is always less than 
some positive number (the window size). We do not know the exact mapping II, 
we assume that we know one service curve /3 for this flow, so that we can replace 
the nonlinear system of Figure 4(c) by the linear system of Figure 4(d), to obtain 
deterministic bounds on the end-to-end delay or the amount of data in transit. 

The reader familiar with traditional circuit and system theory will discover many 
other analogies and differences between the two system theories, while reading this 
book. We should Insist however that no prerequisite in system theory is needed to 
discover network calculus as it is exposed in this book. 
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Chapter 1 



Network Calculus 



In this chapter we introduce the basic network calculus concepts of arrival, ser- 
vice curves and shapers. The application given in this chapter concerns primarily 
networks with reservation services such as ATM or the Internet integrated services 
(“Intserv”). Applications to other settings are given in the following chapters. 



We begin the chapter by defining cumulative functions, which can handle both 
continuous and discrete time models. We show how their use can give a first insight 
into playout buffer issues, which will be revisited with more detail in Chapter 5. 
Then the concepts of Leaky Buckets and Generic Cell Rate algorithms are described 
in the appropriate framework, of arrival curves. We address in detail the most im- 
portant arrival curves: piecewise linear functions and stair functions. Using the stair 
functions, we clarify the relation between spacing and arrival curve. 



We introduce the concept of service curve as a common model for a variety 
of network nodes. We show that all schedulers generally proposed for ATM or the 
Internet integrated services can be modeled by a family of simple service curves 
called the rate-latency service curves. Then we discover physical properties of net- 
works, such as “pay bursts only once” or “greedy shapers keep arrival constraints”. 
We also discover that greedy shapers are min-plus, time invariant systems. Then we 
introduce the concept of maximum service curve, which can be used to account for 
constant delays or for maximum rates. We illustrate all along the chapter how the 
results can be used for practical buffer dimensioning. We give practical guidelines 
for handling fixed delays such as propagation delays. We also address the distortions 
due to variability in packet size. 
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CHAPTER 1. NETWORK CALCULUS 



1.1 Models for Data Flows 

1.1.1 Cumulative Functions, Discrete Time versus Continuous 
Time Models 

It is convenient to describe data flows by means of the cumulative function R{t), 
deflned as the number of bits seen on the flow in time interval [0, f] . By convention, 
we take i?(0) = 0, unless otherwise specified. Function R is always wide-sense 
increasing, that is, it belongs to the space T deflned in Section 3.1.3 on Page 128. 
We can use a discrete or continuous time model. In real systems, there is always a 
minimum granularity (hit, word, cell or packet), therefore discrete time with a finite 
set of values for R(T) could always be assumed. However, it is often computationally 
simpler to consider continuous time, with a function R that may be continuous or 
not. If R(T) is a continuous function, we say that we have a fluid model. Otherwise, 
we take the convention that the function is either right or left-continuous (this makes 
little difference in practice).' Figure 1.1.1 illustrates these deflnitions. 



Convention: A flow is described by a wide-sense increasing function un- 

less otherwise specified, in this book, we consider the following types of models: 

• discrete time: f e N = {0, 1, 2, 3, ...} 

• fluid model: t € = [0, -boo) and i? is a continuous function 

• general, continuous time model: t € IR“'' and i? is a left- or right-continuous 
function 



If we assume that R{t) has a derivative ^ = r(t) such that R(t) = r{s)ds 
(thus we have a fluid model), then r is called the rate function. Here, however, we 
will see that it is much simpler to consider cumulative functions such as R rather 
than rate functions. Contrary to standard algebra, with min-plus algebra we do not 
need functions to have “nice” properties such as having a derivative. 

It is always possible to map a continuous time model R{t) to a discrete time 
model S{n),n € N by choosing a time slot S and sampling by 

S{n) = R{nS) (1.1) 

In general, this results in a loss of information. For the reverse mapping, we use the 
following convention. A continuous time model can he derived from S{n),n € N 
hy letting^ 

= ( 1 - 2 ) 

^It would be nice to stick to either left- or right-continuous functions. However, depending on the 
model, there is no best choice: see Section 1.2.1 and Section 1.7 

^ [(c] (“ceiling of x”) is defined as the smallest integer > x; for example [2.3] = 3 and [2] = 2 
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R3* 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 



Figure 1.1: Examples of Input and Output functions, illustrating our terminol- 
ogy and convention. and show a continuous function of continuous 
time (fluid model); we assume that packets arrive bit by bit, for a duration 
of one time unit per packet arrival. R 2 and show continuous time with 
discontinuities at packet arrival times (times 1 , 4, 8, 8.6 and 14); we assume 
here that packet arrivals are observed only when the packet has been fully 
received; the dots represent the value at the point of discontinuity; by con- 
vention, we assume that the function is left- or right-continuous. R 3 and 
show a discrete time model; the system is observed only at times 0, 1, 2... 
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The resulting function R' is always left-continuous, as we already required. Fig- 
ure 1.1.1 illustrates this mapping with S = 1, S = Rz and R' = i? 2 - 

Thanks to the mapping in Equation (1.1), any result for a continuous time model 
also applies to discrete time. Unless otherwise stated, all results in this book apply 
to both continuous and discrete time. Discrete time models are generally used in 
the context of ATM; in contrast, handling variable size packets is usually done with 
a continuous time model (not necessarily fluid). Note that handling variable size 
packets requires some specific mechanisms, described in Section 1.7. 

Consider now a system S, which we view as a blackbox; S receives input data, 
described by its cumulative function R{t), and delivers the data after a variable 
delay. Call R*{t) the output function, namely, the cumulative function at the output 
of system S. System S might be, for example, a single buffer served at a constant 
rate, a complex communication node, or even a complete network. Figure 1.1.1 
shows input and output functions for a single server queue, where every packet 
takes exactly 3 time units to be served. With output function R j (fluid model) the 
assumption is that a packet can be served as soon as a first bit has arrived (cut- 
through assumption), and that a packet departure can be observed bit by bit, at a 
constant rate. For example, the first packet arrives between times 1 and 2, and leaves 
between times 1 and 4. With output function i ?2 the assumption is that a packet is 
served as soon as it has been fully received and is considered out of the system only 
when it is fully transmitted (store and forward assumption). Here, the first packet 
arrives immediately after time 1, and leaves immediately after time 4. With output 
function i?g (discrete time model), the first packet arrives at time 2 and leaves at 
time 5. 

1.1.2 Backlog and Virtual Delay 

From the input and output functions, we derive the two following quantities of in- 
terest. 

Definition 1.1.1 (Backlog and Delay). For a lossless system: 

• The backlog at time t is R{t) — R*{t). 

• The virtual delay at time t is 

d{f) = inf {r > 0 : R{f) < R*{t + r)} 

The backlog is the amount of bits that are held inside the system; if the system 
is a single buffer, it is the queue length. In contrast, if the system is more complex, 
then the backlog is the number of bits “in transit”, assuming that we can observe 
input and output simultaneously. The virtual delay at time t is the delay that would 
be experienced by a bit arriving at time t if all bits received before it are served 
before it. In Figure 1. 1. 1, the backlog, called x{t), is shown as the vertical deviation 
between input and output functions. The virtual delay is the horizontal deviation. If 
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the input and output function are continuous (fluid model), then it is easy to see that 
R* {t + d{t)) = R{t), and that d{t) is the smallest value satisfying this equation. 

In Figure 1 . 1 . 1 , we see that the values of backlog and virtual delay slightly differ 
for the three models. Thus the delay experienced by the last bit of the first packet is 
d{2) = 2 time units for the first subfigure; in contrast, it is equal to d(l) = 3 time 
units on the second subfigure. This is of course in accordance with the different 
assumptions made for each of the models. Similarly, the delay for the fourth packet 
on subfigure 2 is d{8.6) = 5.4 time units, which corresponds to 2.4 units of waiting 
time and 3 units of service time. In contrast, on the third subfigure, it is equal to 
d{9) = 6 units; the difference is the loss of accuracy resulting from discretization. 



1.1.3 Example: The Playout Buffer 

Cumulative functions are a powerful tool for studying delays and buffers. In order 
to illustrate this, consider the simple playout buffer problem that we describe now. 
Consider a packet switched network that carries bits of information from a source 
with a constant bit rate r (Figure 1.2) as is the case for example, with circuit em- 
ulation. We take a fluid model, as illustrated in Figure 1.2. We have a first system 
S, the network, with input function R{t) = rt. The network imposes some variable 
delay, because of queuing points, therefore the output R * does not have a constant 
rate r. What can be done to recreate a constant bit stream ? A standard mechanism 




(D2) 



R*(t) 




tl 



Figure 1.2: A Simple Playout Buffer Example 



is to smooth the delay variation in a playout buffer. It operates as follows. When the 
first bit of data arrives, at time dr{0), where dr{0) = limj_j.o,t>o d{t) is the limit to 
the right of function d ^ , it is stored in the buffer until a fixed time A has elapsed. 
Then the buffer is served at a constant rate r whenever it is not empty. This gives us 
a second system S', with input R* and output S. 

Let us assume that the network delay variation is bounded by A. This implies 
that for every time t, the virtual delay (which is the real delay in that case) satisfies 

^It is the virtual delay for a hypothetical bit that would arrive just after time 0. Other authors often 
use the notation d(O-h) 
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—A < d{t) — dr{0) < A 

Thus, since we have a fluid model, we have 

r{t - dr{0) - A) < R*{t) < r{t - dr{0) + A) 

which is illustrated in the flgure by the two lines (Dl) and (D2) parallel to R{t) . The 
flgure suggests that, for the play out buffer 5' the input function R* is always above 
the straight line (D2), which means that the playout buffer never underflows. This 
suggests in turn that the output function S{t) is given by S{t) = r{t — dr{0) — A). 

Formally, the proof is as follows. We proceed by contradiction. Assume the 
buffer starves at some time, and let f i be the first time at which this happens. Clearly 
the playout buffer is empty at time 1 1 , thus R*{ti) = S{ti). There is a time interval 
[ti ,ti+e] during which the number of bits arriving at the playout buffer is less than 
re (see Figure 1.2. Thus, d{t\ + e) > dr{0) + A which is not possible. Secondly, 
the backlog in the buffer at time t is equal to R* (t) — S (t) , which is bounded by the 
vertical deviation between (Dl) and (D2), namely, 2rA. 

We have thus shown that the playout buffer is able to remove the delay variation 
imposed by the network. We summarize this as follows. 

Proposition 1.1.1. Consider a constant bit rate stream of rate r, modified by a 
network that imposes a variable delay variation and no loss. The resulting flow 
is put into a playout buffer, which operates by delaying the first bit of the flow by 
A, and reading the flow at rate r. Assume that the delay variation imposed by the 
network is bounded by A, then 

1. the playout buffer never starves and produces a constant output at rate r; 

2. a buffer size o/2Ar is sufficient to avoid overflow. 

We study playout buffers in more details in Chapter 5, using the network calcu- 
lus concepts further introduced in this chapter. 

1.2 Arrival Curves 

1.2.1 Definition of an Arrival Curve 

Assume that we want to provide guarantees to data flows. This requires some spe- 
cific support in the network, as explained in Section 1 .3; as a counterpart, we need to 
limit the traffic sent by sources. With integrated services networks (ATM or the in- 
tegrated services internet), this is done by using the concept of arrival curve, defined 
below. 

Definition 1.2.1 (Arrival Curve). Given a wide-sense increasing function a de- 
fined for f > 0 {namely, a € T), we say that a flow R is constrained by a if and 
only if for all s < t: 

R(t) — R(s) < a(t — s) 

We say that R has a as an arrival curve, or also that R is a-smooth. 
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Note that the condition is over a set of overlapping intervals, as Figure 1.3 illus- 
trates. 




Figure 1.3: Example of Constraint by arrival curve, showing a cumulative 
function R{t) constrained by the arrival curve a{t). 



Affine Arrival Curves: For example, if a{t) = rt, then the constraint means 

that, on any time window of width r, the number of bits for the flow is limited by 
rr. We say in that case that the flow is peak rate limited. This occurs if we know 
that the flow is arriving on a link whose physical bit rate is limited by r b/s. A flow 
where the only constraint is a limit on the peak rate is often (improperly) called a 
“constant bit rate” (CBR) flow, or “deterministic bit rate” (DBR) flow. 

Flaving a{t) = b, with b a constant, as an arrival curve means that the maximum 
number of bits that may ever be sent on the flow is at most b. 

More generally, because of their relationship with leaky buckets, we will often 
use affine arrival curves 7 ^, 6 , defined by: 'Yr,b{t) = rt + b for t > 0 and 0 otherwise 
(see Section 3.1.3 for an illustration). Having 7^,6 as an arrival curve allows a source 
to send b bits at once, but not more than r b/s over the long run. Parameters b and r 
are called the burst tolerance (in units of data) and the rate (in units of data per time 
unit). Figure 1.3 illustrates such a constraint. 

Stair Functions as Arrival Curves: In the context of ATM, we also use arrival 

curves of the form kvT,r, where vt,t is the stair functions defined by VT,r{t) = 
for t > 0 and 0 otherwise (see Section 3.1.3 for an illustration). Note that 
VT,T{t) = VT,o{t + t ), thus vt,t results from vt,o by a time shift to the left. Param- 
eter T (the “interval”) and r (the “tolerance”) are expressed in time units. In order to 
understand the use of vt,t, consider a flow that sends packets of a fixed size, equal 
to k unit of data (for example, an ATM flow). Assume that the packets are spaced by 
at least T time units. An example is a constant bit rate voice encoder, which gener- 
ates packets periodically during talk spurts, and is silent otherwise. Such a flow has 
kvT,o as an arrival curve. 
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Assume now that the flow is multiplexed with some others. A simple way to 
think of this scenario is to assume that the packets are put into a queue, together 
with other flows. This is typically what occurs at a workstation, in the operating 
system or at the ATM adapter. The queue imposes a variable delay; assume it can 
be bounded by some value equal to r time units. We will see in the rest of this 
chapter and in Chapter 2 how we can provide such bounds. Call R{t) the input 
function for the flow at the multiplexer, and R * (t) the output function. We have 
R*{s) < R{s — t), from which we derive: 

R*(t) — R*(s) < R(t) — R(s — t) < kvT,o(t — s + t) = kvT,T(t — s) 

Thus R* has kvr,T as an arrival curve. We have shown that a periodic flow, with 
period T, and packets of constant size k, that suffers a variable delay < r, has 
kvr,T CIS an arrival curve. The parameter r is often called the “one-point cell delay 
variation”, as it corresponds to a deviation from a periodic flow that can be observed 
at one point. 

In general, function vt,t can be used to express minimum spacing between pack- 
ets, as the following proposition shows. 

Proposition 1.2.1 (Spacing as an arrival constraint). Consider a flow, with cumu- 
lative function R{t), that generates packets of constant size equal to k data units, 
with instantaneous packet arrivals. Assume time is discrete or time is continuous 
and R is left-continuous. Call t„ the arrival time for the nth packet. The following 
two properties are equivalent: 

1. for all TO, n, tm-\-n ~ tm > nT — t 

2. the flow has kvx,T os an arrival curve 

The conditions on packet size and packet generation mean that R(f) has the form 
nk, with n e N. The spacing condition implies that the time interval between two 
consecutive packets is > T — r, between a packet and the next but one is > 2T — t, 
etc. 

Proof: Assume that property 1 holds. Consider an arbitrary interval ] s , f] , and call 

n the number of packet arrivals in the interval. Say that these packets are numbered 
TO -I- 1, . . . , TO -I- n, so that s < fm-i-i < • • • < tm-vn < C from which we have 

f S tra-\-n fm-t-1 

Combining with property 1 , we get 

t — s > (n — 1)T — r 

From the definition of vt,t it follows that vx.rit — s) > n. Thus R{f) — R{s) < 
kvT,r{t — s)> which shows the first part of the proof. 

Conversely, assume now that property 2 holds. If time is discrete, we convert the 
model to continuous time using the mapping in Equation 1.2, thus we can consider 
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that we are in the continuous time case. Consider some arbitrary integers to, n; for 
all e > 0, we have, under the assumption in the proposition; 

R{tm+n + e) - R{tm) > {n + l)k 
thus, from the definition of vt,t, 

Im+n Im "h € ^ riT' T 

This is true for all e > 0, thus tm+n ~ tm > nT — t. □ 

In the rest of this section we clarify the relationship between arrival curve con- 
straints defined by affine and by stair functions. First we need a technical lemma, 
which amounts to saying that we can always change an arrival curve to be left- 
continuous. 

Lemma 1.2.1 (Reduction to left-continuous arrival curves). Consider a flow 
R{t) and a wide sense increasing function a{t), defined for t > 0. Assume that 
R is either left-continuous, or right-continuous. Denote with aflt) the limit to the 
left of a at t (this limit exists at every point because a is wide sense increasing); we 
have ai(t) = sup^^j a(s). If a is an arrival curve for R, then so is ai. 

Proof: Assume first that R is left-continuous. For some s < t, let be a sequence 

of increasing times converging towards t, with s <tn < t. We have —i?(s) < 
a{tn — s) < ai{t — s). Now lim„_j._|_oo R{tn) = R{t) since we assumed that R is 
left-continuous. Thus R{t) — R{s) < aflt — s). 

If in contrast R is right-continuous, consider a sequence s „ converging towards 
s from above. We have similarly R{t) — R{sn) < ct{t — Sn) < ctflt — s) and 
lim„_j._|_oo R{sn) = R{s), thus R{f) — R{s) < aflt — s) as well. □ 

Based on this lemma, we can always reduce an arrival curve to be left-continuous 
Note that and vt,t are left-continuous. Also remember that, in this book, we use 
the convention that cumulative functions such as R(t) are left continuous; this is a 
pure convention, we might as well have chosen to consider only right-continuous 
cumulative functions. In contrast, an arrival curve can always be assumed to be left- 
continuous, but not right-continuous. 

In some cases, there is equivalence between a constraint defined by jj.,b and 
vt,t- For example, for an ATM flow (namely, a flow where every packet has a fixed 
size equal to one unit of data) a constraint jj.,b with r = ^ and 6 = 1 is equivalent 
to sending one packet every T time units, thus is equivalent to a constraint by the 
arrival curve vt,o ■ In general, we have the following result. 

Proposition 1.2.2. Consider either a left- or right- continuous flow R{f),t € ffi"*", 
or a discrete time flow R(t), f € N, that generates packets of constant size equal to 
k data units, with instantaneous packet arrivals. For some T and t, let r = ^ and 
h = k{^ V). It is equivalent to say that R is constrained by 'jr,b or by kvT,r- 

'*If we consider ar(t), the limit to the right of a at t, then a < ar thus ar is always an arrival curve, 
however it is not better than a. 
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Proof: Since we can map any discrete time flow to a left-continuous, continuous 

time flow, it is sufficient to consider a left-continuous flow R{t),t € . Also, by 

changing the unit of data to the size of one packet, we can assume without loss of 
generality that k = 1. Note first, that with the parameter mapping in the proposition, 
we have vt,t < 7r,b, which shows that if vt,t is an arrival curve for R, then so is 
Tr^b- 

Conversely, assume now that R has ^^,b as an arrival curve. Then for all s < t, 
we have R{t) — R{s) < rt + b, and since R{t) — R{s) € N, this implies R{t) — 
R{s) < [rt + b\ , Call a{t) the right handside in the above equation and apply 
Lemma 1.2.1. We have ai{t) = [rt + b — T] = UT,r(f)- D 

Note that the equivalence holds if we can assume that the packet size is constant 
and equal to the step size in the constraint kvT,r- In general, the two families of 
arrival curve do not provide identical constraints. For example, consider an ATM 
flow, with packets of size 1 data unit, that is constrained hy an arrival curve of the 
form kvx,T, for some A: > 1. This flow might result from the superposition of several 
ATM flows. You can convince yourself that this constraint cannot he mapped to a 
constraint of the form 'yr,b- We will come back to this example in Section 1.4.1. 

1.2.2 Leaky Bucket and Generic Cell Rate Algorithm 

Arrival curve constraints find their origins in the concept of leaky bucket and generic 
cell rate algorithms, which we describe now. We show that leaky buckets correspond 
to affine arrival curves 7 ^,;,, while the generic cell rate algorithm corresponds to stair 
functions vt,t- For flows of fixed size packets, such as ATM cells, the two are thus 
equivalent. 

Definition 1.2.2 (Leaky Bncket Controller). A Leaky Bucket Controller is a device 
that analyzes the data on a flow R(t) as follows. There is a pool (bucket) of fluid of 
size b. The bucket is initially empty. The bucket has a hole and leaks at a rate ofr 
units of fluid per second when it is not empty. 

Data from the flow R(t) has to pour into the bucket an amount of fluid equal 
to the amount of data. Data that would cause the bucket to overflow is declared 
non-conformant, otherwise the data is declared conformant. 

Figure 1.2.2 illustrates the definition. Fluid in the leaky bucket does not represent 
data, however, it is counted in the same unit as data. 

Data that is not able to pour fluid into the bucket is said to be “non-conformant” 
data. In ATM systems, non-conformant data is either discarded, tagged with a low 
priority for loss (“red” cells), or can be put in a buffer (buffered leaky bucket con- 
troller). With the Integrated Services Internet, non-conformant data is in principle 
not marked, but simply passed as best effort traffic (namely, normal IP traffic). 

We want now to show that a leaky bucket controller enforces an arrival curve 
constraint equal to ^^.b- We need the following lemma. 

Lemma 1.2.2. Consider a buffer served at a constant rate r. Assume that the buffer 
is empty at time 0. The input is described by the cumulative function R(t). If there 
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Figure 1.4: A Leaky Bucket Controller. The second part of the figure shows 
(In grey) the level of the bucket x{t) for a sample Input, with r = 0.4 kbits 
per time unit and b = 1.5 kbits. The packet arriving at time t = 8.6 is not 
conformant, and no fluid is added to the bucket. If b would be equal to 2 
kbits, then all packets would be conformant. 



is no overflow during [0, t], the buffer content at time t is given by 
x{t) = sup {R(t) — R(s) — r(t — s)} 

s-.s<t 



Proof: The lemma can be obtained as a special case of Corollary 1 .5.2 on page 40, 
however we give here a direct proof. First note that for all s such that s <t, {t — s)r 
is an upper bound on the number of bits output in ]s, f], therefore: 

R(t) — R(s) — x(t) + x(s) < (t — s)r 



Thus 

x(t) > R(t) — R(s) + x(s) — (t — s)r > R(t) — R(s) — (t — s)r 

which proves that x{t) > supg,gCf{R{t) — R{s) — r{t — s)}. 

Conversely, call to the latest time at which the buffer was empty before time t: 

to = sup{s ■ s < t, x(s) = 0} 

(If x{t) > 0 then to is the beginning of the busy period at time t). During ]to,t], the 
queue is never empty, therefore it outputs bit at rate r, and thus 

x(t) = x(to) + R(t) - R(to) - (t - to)r (1.3) 

We assume that R is left-continuous (otherwise the proof is a little more com- 
plex); thus x{to) = 0 and thus x{t) < sup g. g^^{R{t) — R{s) — r{t — s)} □ 

Now the content of a leaky bucket behaves exactly like a buffer served at rate 
r, and with capacity b. Thus, a flow R{t) is conformant if and only if the bucket 
content x{t) never exceeds b. From Lemma 1.2.2, this means that 
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sup {R{t) — R{s) — r{t — s)} < b 

s:s<t 

which is equivalent to 

R(t) — R(s) < r(t — s) +b 
for all s <t. We have thus shown the following. 

Proposition 1.2.3. A leaky bucket controller with leak rate r and bucket size b forces 
a flow to be constrained by the arrival curve yr,b< namely: 

1. the flow of conformant data has yr,b os on arrival curve; 

2. if the input already has yj.,b os an arrival curve, then all data is conformant. 

We will see in Section 1.4.1 a simple interpretation of the leaky bucket param- 
eters, namely: r is the minimum rate required to serve the flow, and b is the buffer 
required to serve the flow at a constant rate. 

Parallel to the concept of leaky bucket is the Generic Cell Rate Algorithm 
(GCRA), used with ATM. 

Definition 1.2.3 (GCRA (T, r)). The Generic Cell Rate Algorithm (GCRA) with 
parameters (T,t) is used with fixed size packets, called cells, and defines confor- 
mant cells as follows. It takes as input a cell arrival time t and returns result. It 
has an internal (static) variable tat (theoretical arrival time). 

• initially, tat = 0 

• when a cell arrives at time t, then 

if (t < tat - tau) 

result = NON- CONFORMANT; 
else { 

tat = max (t, tat) + T; 
result = CONFORMANT; 

} 



Table 1 . 1 illustrate the definition of GCRA. It illustrates that ^ is the long term 
rate that can be sustained by the flow (in cells per time unit); while r is a tolerance 
that quantifies how early cells may arrive with respect to an ideal spacing of T 
between cells. We see on the first example that cells may be early by 2 time units 
(cells arriving at times 18 to 48), however this may not be cumultated, otherwise the 
rate of ^ would be exceeded (cell arriving at time 57). 

In general, we have the following result, which establishes the relationship be- 
tween GCRA and the stair functions vt,t- 

Proposition 1.2.4. Consider a flow, with cumulative function R(t), that generates 
packets of constant size equal to k data units, with instantaneous packet arrivals. 
Assume time is discrete or time is continuous and R is left-continuous. The following 
two properties are equivalent: 
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Table 1.1: Examples for GCRA(10,2). The table gives the cell arrival times, 
the value of the tat internal variable just before the cell arrival, and the 
result for the cell (c = conformant, non-c = non-conformant). 

1. the flow is conformant to GCRA(T,t) 

2. the flow has {k vt,t) tm arrival curve 

Proof: The proof uses max-plus algebra. Assume that property 1 holds. Denote 

with 6n the value of tat just after the arrival of the nth packet (or cell), and by 
convention = 0- Also call the arrival time of the nth packet. From the defini- 
tion of the GCRA we have = max(f„, 0„_i) -|- T. We write this equation for all 
m <n, using the notation V for max. The distributivity of addition with respect to 
V gives: 

r = ( 0„_1 + T) V (f„ + T) 

I 0„_1 + T = {dn-2 + 2T) V (f„_i + 2T) 

[ 6»i -b (n - 1)T = (6»o + nT) V (G -b nT) 

Note that {Oq + nT) V -b nT) = ti + nT because = 0 and ti > 0, thus 
the last equation can be simplified to 0i + (n — 1)T = ti + nT. Now the iterative 
substitution of one equation into the previous one, starting from the last one, gives 

Sn = (tn +T) V {fn-1 + 2T) V . . . V (fi -b nT) (1-4) 

Now consider the (to -b n)th arrival, for some to, n € N, with to > 1. By property 
1, the packet is conformant, thus 

tm+n ^ ^m-t-n — 1 f (1-5) 

Now from Equation (1.4), > fj -b (ni-bn — j)T for all 1 <j <m + n — l. 

For j = TO, we obtain 6m+n-i > + nT. Combining this with Equation (1.5), 

we have tm+n > + nT — t. With proposition 1.2.1, this shows property 2. 

Conversely, assume now that property 2 holds. We show by induction on n that 
the nth packet is conformant. This is always true for n = 1. Assume it is true for 
all TO < n. Then, with the same reasoning as above, Equation (1.4) holds for n. We 
rewrite it as = maxi<j<„{fj -b (n — j -b 1)T). Now from proposition 1.2.1, 
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tn+i > tj + {n—j + 1)T — r for all 1 < j < n, thus tn+i > maxi<j<„{fj + (n — 
j + 1)T} — T. Combining the two, we find that tn+i > On ~ t, thus the (n + l)th 
packet is conformant. □ 

Note the analogy between Equation (1.4) and Lemma 1 .2.2. Indeed, from propo- 
sition 1 . 2 . 2 , for packets of constant size, there is equivalence between arrival con- 
straints by affine functions 7 ^,;, and by stair functions vt,t- This shows the following 
result. 

Corollary 1.2.1. For a flow with packets of constant size, satisfying the GCRA(T, r) 
is equivalent to satisfying a leaky bucket controller, with rate r and burst tolerance 
b given by: 

b = {^ + l)S 

S 

1’ = zz: 

T 

In the formulas, <5 is the packet size in units of data. 

The corollary can also be shown by a direct equivalence of the GCRA algorithm 
to a leaky bucket controller. 

Take the ATM cell as unit of data. The results above show that for an ATM cell 
flow, being conformant to GCRA(T, r) is equivalent to having vt,t as an arrival 
curve. It is also equivalent to having 7 ^,;, as an arrival curve, with r = -^ and b = 
^ + 1 . 

Consider a family of I leaky bucket controllers (or GCRAs), with parameters 
ri,bi, for 1 < i < 7. If we apply all of them in parallel to the same flow, then the 
conformant data is data that is conformant for each of the controllers in isolation. 
The flow of conformant data has as an arrival curve 

a{t) = min ( 7 ^,, ft, (f)) = min (nt + bi) 

1 < 7 </ 

It can easily be shown that the family of arrival curves that can be obtained in 
this way is the set of concave, piecewise linear functions, with a flnite number of 
pieces. We will see in Section 1.5 some examples of functions that do not belong to 
this family. 

Application to ATM and the Internet Leaky buckets and GCRA are used by 
standard bodies to define conformant flows in Integrated Services Networks. With 
ATM, a constant bit rate connection (CBR) is defined by one GCRA (or equiva- 
lently, one leaky bucket), with parameters (T, r). T is called the ideal cell interval, 
and T is called the Cell Delay Variation Tolerance (CDVT). Still with ATM, a vari- 
able bit rate (VBR) connection is defined as one connection with an arrival curve 
that corresponds to 2 leaky buckets or GCRA controllers. The Integrated services 
framework of the Internet (Intserv) uses the same family of arrival curves, such as 



a{t) = min(M + pt, rt + b) 



( 1 - 6 ) 
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where M is interpreted as the maximum packet size, p as the peak rate, b as the burst 
tolearance, and r as the sustainable rate (Figure 1.5). In Intserv jargon, the 4-uple 
(p, M, r, b) is also called a T-SPEC (traffic specification). 




Figure 1.5: Arrival curve for ATM VBR and for Intserv flows 



1.2.3 Sub-additivity and Arrival Curves 

In this Section we discover the fundamental relationship between min-plus algebra 
and arrival curves. Let us start with a motivating example. 

Consider a flow R{t) G N with t € N; for example the flow is an ATM cell flow, 
counted in cells. Time is discrete to simplify the discussion. Assume that we know 
that the flow is constrained by the arrival curve 3t;io,o; for example, the flow is the 
superposition of 3 CBR connections of peak rate 0.1 cell per time unit each. Assume 
in addition that we know that the flow arrives at the point of observation over a link 
with a physical characteristic of 1 cell per time unit. We can conclude that the flow 
is also constrained by the arrival curve v\fi. Thus, obviously, it is constrained by 
a\ = min(3t;io,0 5 tn,o)- Figure 1.6 shows the function a i. 




Figure 1.6: The arrival curve ai = min(3t)io,o, t'l.o) on the left, and its sub- 
additive closure (“good” function) di on the right. Time is discrete, lines are 
put for ease of reading. 

Now the arrival curve a\ tells us that J?(10) < 3 and i?(ll) < 6. However, 
since there can arrive at most 1 cell per time unit , we can also conclude that 
i?(ll) < i?(10) -I- [i?(ll) - i?(10)] < q;i(10) -I- q;i( 1) = 4. In other words. 
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the sheer knowledge that R is constrained hy a i allows us to derive a better bound 
than ai itself. This is because ai is not a “good” function, in a sense that we define 
now. 

Definition 1.2.4. Consider a function a in calF. We say that a is a “ good” function 
if any one of the following equivalent properties is satisfied 

1. a is sub-additive and Q!(0) = 0 

2 . a = a® a 

3. a® a = a 

4. a = a (sub-additive closure of a). 

The definition uses the concepts of sub-additivity, min-plus convolution, min- 
plus deconvolution and sub-additive closure, which are defined in Chapter 3. The 
equivalence between the four items comes from Corollaries 3.1.1 on page 144 
and 3.1.13 on page 151. Sub-additivity (item 1) means that a{s -Pt) < a{s) -P a{t). 
If a is not sub-additive, then a(s) -I- a(t) may be a better bound than a(s -I- 1), as is 
the case with ai in the example above. Item 2, 3 and 4 use the concepts of min-plus 
convolution, min-plus deconvolution and sub-additive closure, defined in Chapter 3. 
We know in particular (Theorem 3.1.10) that the sub-additive closure of a function 
a is the largest “good” function a such that a < a. We also know that a G IF if 
a€ R. 

The main result about arrival curves is that any arrival curve can be replaced by 
its sub-additive closure, which is a “good” arrival curve. Figure 1.6 shows dii for 
our example above. 

Theorem 1.2.1 (Reduction of Arrival Curve to a Sub- Additive One). Saying that 
a flow is constrained by a wide -sense increasing function a is equivalent to saying 
that it is constrained by the sub-additive closure d. 

The proof of the theorem leads us to the heart of the concept of arrival curve, 
namely, its correspondence with a fundamental, linear relationships in min-plus al- 
gebra, which we will now derive. 

Lemma 1.2.3. A flow R is constrained by arrival curve a if and only if R < R® a 

Proof: Remember that an equation such as R < R® a means that for all times 

t, R{t) < (R ® a){t). The min-plus convolution R ® a is defined in Chapter 3, 
page 134; since R{s) and a{s) are defined only for s > 0, the definition of R® a 
is: (R ® a){t) = info<s<i (i?(s) -I- aft — s)). Thus R < R® ais equivalent to 
R(f) < R(s) -\- aft — s) for all 0 < s < t. □ 

Lemma 1.2.4. Ifa\ and a 2 are arrival curves for a flow R, then so is a\ ® a 2 
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Proof: We know from Chapter 3 that ai ® Q!2 is wide-sense increasing if ai 

and Q!2 are. The rest of the proof follows immediately from Lemma 1.2.3 and the 
associativity of □ 

Proof of Theorem Since a is an arrival curve, so is a ® a, and by iteration, so 
is for all n > 1. By the definition of So, it is also an arrival curve. Thus so is 

a = inf„>oQ!(”). 

Conversely, a < a\ thus, if a is an arrival curve, then so is a. □ 



Examples We should thus restrict our choice of arrival curves to sub-additive 
functions. As we can expect, the functions jj.,b and vt,t introduced in Section 1.2.1 
are sub-additive and since their value is 0 for f = 0, they are “good” functions, as 
we now show. Indeed, we know from Chapter 1 that any concave function a such 
that Q!(0) = 0 is sub-additive. This explains why the functions jj.,b are sub-additive. 

Functions vt,t are not concave, but they still are sub-additive. This is because, 
from its very definition, the ceiling function is sub-additive, thus 



VT,r(s+t) = r 1 < = VT,T{s)+VT,T{t) 



Let us return to our introductory example with ai = min(3uio,0 5 As we dis- 
cussed, ai is not sub-additive. From Theorem 1.2.1, we should thus replace a i by 
its sub-additive closure ai, which can be computed by Equation (3.13). The com- 
putation is simplified by the following remark, which follows immediately from 
Theorem 3.1.11: 



Lemma 1.2.5. Let 71 and 72 be two “good” functions. The sub-additive closure of 
min (71 , 72) is 71 ® 72. 

We can apply the lemma to ai = 3uio,o A v\fi, since vt,t is a “good” function. 
Thus cfi = 3uio,o ® which the alert reader will enjoy computing. The result is 
plotted in Figure 1.6. 

Finally, let us mention the following equivalence, the proof of which is easy and 
left to the reader. 

Proposition 1.2.5. For a given wide-sense increasing function a, with q:(0) = 0, 
consider a source defined by R(t) = a(t) (greedy source). The source has a as an 
arrival curve if and only if a is a “ good” function. 



VBR arrival curve Now let us examine the family of arrival curves obtained by 
combinations of leaky buckets or GCRAs (concave piecewise linear functions). We 
know from Chapter 3 that if 71 and 72 are concave, with 71 (0) = 72(0) = 0, 
then 7i ® 72 = 71 A 72 . Thus any concave piecewise linear function a such that 
q:( 0) = 0 is a “good” function. In particular, if we define the arrival curve for VBR 
connections or Intserv flows by 
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J a(t) = min(pt + M,rt + b) if i > 0 

\ a(0) = 0 

(see Figure 1.5) then a is a “good” function. 

We have seen in Lemma 1.2.1 that an arrival curve a can always be replaced by 
its limit to the left ai . We might wonder how this combines with the sub-additive 
closure, and in particular, whether these two operations commute (in other words, do 
we have (a)i = aj ?). In general, if a is left-continuous, then we cannot guarantee 
that a is also left-continuous, thus we cannot guarantee that the operations commute. 
However, it can be shown that (a)i is always a “good” function, thus (a)i = (a)i- 
Starting from an arrival curve a we can therefore improve by taking the sub-additive 
closure first, then the limit to the left. The resulting arrival curve (a) / is a “good” 
function that is also left-continuous (a “very good” function), and the constraint by 
a is equivalent to the constraint by (a) / 

Lastly, let us mention that it can easily be shown, using an argument of uniform 
continuity, that if a takes only a finite set of values over any bounded time interval, 
and if a is left-continuous, then so is a and then we do have (a)i = aj. This 
assumption is always true in discrete time, and in most cases in practice. 

1.2.4 Minimum Arrival Curve 

Consider now a given flow R{t), for which we would like to determine a minimal 
arrival curve. This problem arises, for example, when R is known from measure- 
ments. The following theorem says that there is indeed one minimal arrival curve. 

Theorem 1.2.2 (Minimum Arrival Curve). Consider a flow R{t)t>o- Then 

• function R 0 R is an arrival curve for the flow 

• for any arrival curve a that constrains the flow, we have: {R 0 R) < a 

• R 0 R is a “ good” function 

Function R0 R is called the minimum arrival curve, for flow R. 

The minimum arrival curve uses min-plus deconvolution, defined in Chapter 3. 
Figure 1.2.4 shows an example of i? 0 i? for a measured function R. 

Proof: By definition of 0 , we have {R 0 R){t) = sup^>o{i?(i -b u) — R{v)}, it 

follows that {R 0 R) is an arrival curve. 

Now assume that some a is also an arrival curve for R. From Lemma 1.2.3, we 
have R < R 0 a). From Rule 14 in Theorem 3.1.12 in Chapter 3, it follows that 
R 0 R < a, which shows that R 0 R is the minimal arrival curve for R. Lastly, 
i? 0 i? is a “good” function from Rule 15 in Theorem 3.1.12. □ 

Consider a greedy source, with R{t) = a{t), where a is a “good” function. 
What is the minimum arrival curve Lastly, the curious reader might wonder 

^Answer: from the equivalence in Definition 1.2.4, the minimum arrival curve is a itself. 
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Figure 1.7: Example of minimum arrival curve. Time is discrete, one time unit 
is 40 ms. The top figures shows, for two similar traces, the number of packet 
arrivals at every time slot. Every packet is of constant size (416 bytes). The 
bottom figure shows the minimum arrival curve for the first trace (top curve) 
and the second trace (bottom curve). The large burst in the first trace comes 
earlier, therefore its minimum arrival curve is slightly larger. 
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whether i? 0 i? is left-continuous. The answer is as follows. Assume that R is either 
right or left-continuous. By lemma 1.2.1, the limit to the left {R(Z) R)i is also an 
arrival curve, and is bounded from above by i? 0 i?. Since i? 0 i? is the minimum 
arrival curve, it follows that (i? 0 i?)/ = RO) R, thus R(Z) R is left-continuous (and 
is thus a “very good” function). 

In many cases, one is interested not in the absolute minimum arrival curve as 
presented here, but in a minimum arrival curve within a family of arrival curves, for 
example, among all 7 ^,;, functions. For a development along this line, see [54]. 



1.3 Service Curves 

1.3.1 Definition of Service Curve 

We have seen that one first principle in integrated services networks is to put arrival 
curve constraints on flows. In order to provide reservations, network nodes in return 
need to offer some guarantees to flows. This is done by packet schedulers [39]. The 
details of packet scheduling are abstracted using the concept of service curve, which 
we introduce and study in this section. Since the concept of service curve is more 
abstract than that of arrival curve, we introduce it on some examples. 

A first, simple example of a scheduler is a Generalized Processor Sharing (GPS) 
node [56]. We define now a simple view of GPS; more details are given in Chapter 2. 
A GPS node serves several flows in parallel, and we can consider that every flow is 
allocated a given rate. The guarantee is that during a period of duration t, for which 
a flow has some backlog in the node, it receives an amount of service at least equal 
to rt, where r is its allocated rate. A GPS node is a theoretical concept, which is 
not really implementable, because it relies on a fluid model, while real networks use 
packets. We will see in Section 2.1 on page 83 how to account for the difference 
between a real implementation and GPS. Consider a input flow R, with output R*, 
that is served in a GPS node, with allocated rate r. Let us also assume that the node 
buffer is large enough so that overflow is not possible. We will see in this section how 
to compute the buffer size required to satisfy this assumption. Lossy systems are the 
object of Chapter 9. Under these assumptions, for all time t, call to the beginning of 
the last busy period for the flow up to time t. From the GPS assumption, we have 

R*{t) - R*{to) > r{t - to) 

Assume as usual that R is left-continuous; at time to the backlog for the flow is 
0, which is expressed by R{to) — R*{to) = 0. Combining this with the previous 
equation, we obtain: 

R*{t) - R{to) > r{t - to) 

We have thus shown that, for all time t: R*{t) > info<s<i [i?(s) -I- r{t — s)j, which 
can be written as 



i?* > i? 0 Jr,0 



(1-7) 
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Note that a limiting case of GPS node is the constant bit rate server with rate r, 
dedicated to serving a single flow. We will study GPS in more details in Chapter 2. 

Consider now a second example. Assume that the only information we have 
about a network node is that the maximum delay for the bits of a given flow R is 
bounded by some fixed value T, and that the bits of the flow are served in first in, 
first out order. We will see in Section 1 .5 that this is used with a family of schedulers 
called “earliest deadline first” (EDF). We can translate the assumption on the delay 
bound to d{t) < T for all t. Now since R* is always wide-sense increasing, it 
follows from the definition of d{t) that R*{t + T) > R{t). Conversely, if R*{t + 
T) > R{t), then d{t) < T. In other words, our condition that the maximum delay 
is bounded by T is equivalent to R*{t + T) > R{t) for all t. This in turn can be 
re-written as 

R*{s) > R{s-T) 

for all s > T. We have introduced in Chapter 3 the “impulse” function St defined 
by (5T(f) = 0 if 0 < f < T and Sxit) = -foo if f > T. It has the property that, for 
any wide-sense increasing function x{t), defined fori < 0, (a; ® ST){t) = x{t — T) 
ift>T and {x ® ST){t) = a;(0) otherwise. Our condition on the maximum delay 
can thus be written as 

R*>R^St ( 1 . 8 ) 

For the two examples above, there is an input-output relationship of the same 
form (Equations (1.7) and (1.8)). This suggests the definition of service curve, 
which, as we see in the rest of this section, is indeed able to provide useful results. 



P(t) 

time 



Figure 1.8: Definition of service curve. The output R* must be above R® (i, 
which is the lower envelope of all curves t R{to) + (3{t — to). 




Definition 1.3.1 (Service Curve). Consider a system S and a flow through S with 
input and output function R and R* .We say that S offers to the flow a service curve 
if and only if € T and R* > R<^ (3 



Figure 1.8 illustrates the definition. 
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The definition means that /3 is a wide sense increasing function, with /3(0) = 0, 
and that for all f > 0, 



R*{t) > inf (R{s) + (3{t — s)) 

S<t 

In practice, we can avoid the use of an infimum if (i is continuous. The following 
proposition is an immediate consequence of Theorem 3.1.8 on Page 139. 

Proposition 1.3.1. If (3 is continuous, the service curve property means that for all 
t we can find to <t such that 

R*{t)>Ri{to)+l3{t-to) (1.9) 

where Ri(to) = -R(s) ^ the limit to the left of R at to- If R is left- 

continuous, then Ri{to) = R{to)- 

For a constant rate server (and also for any strict service curve, the number f o in 
Equation (1.9) is the beginning of the busy period. For other cases, there is not such 
a simple definition. However, in some cases we can make sure that to increases with 
t: 

Proposition 1.3.2. If the service curve (3 is convex, then we can find some wide 
sense increasing function r{f) such that we can choose to = T{f) in Equation (1.9). 

Note that since a service curve is assumed to be wide-sense increasing, f3, being 
convex, is necessarily continuous; thus we can apply Proposition 1.3.1. 

Proof: We give the proof when R is left-continuous. The proof for the general 

case is essentially the same but involves some e cutting. Consider some t\ <t 2 and 
call Ti a value of to as in Equation (1.9)) at t = ti . Also consider any t' <t\. From 
the definition of t\ , we have 

R*(t')+^(ti-t')>R*{Ti)+^(ti-Ti) 



and thus 

R*(t') -b j3(t2 — t') > R*{ti) -b l3(ti — Ti) — l3(ti — t') -b I3(t2 — t') 

Now f3 is convex, thus for any four numbers a,b,c,d such that a<c<b,a<d<b 
and a -b 6 = c -b d, we have 

(3{a) -b (3{b) > 13(c) -b (3(d) 

(the interested reader will be convinced by drawing a small figure). Applying this to 
a = t\ — Ti,b = t 2 —t',c = t\ —t',d = t 2 — Ti gives 



R*(t')-i-l3(t2-t')>R*(Ti)-i-l3(t2-Ti) 
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and the above equation holds for all f ' < ti . Consider now the minimum, for a 
fixed t- 2 , of R*{t') + P{t 2 — t') over all t' < t 2 . The above equation shows that the 
minimum is reached for some t' >t\. □ 

We will see in Section 1.4 that the combination of a service curve guarantee 
with an arrival curve constraint forms the basis for deterministic bounds used in 
integrated services networks. Before that, we give the fundamental service curve 
examples that are used in practice. 

1.3.2 Classical Service Curve Examples 

Guaranteed Delay Node The analysis of the second example in Section 1.3.1 can 
be rephrased as follows. 

Proposition 1.3.3. For a lossless bit processing system, saying that the delay for 
any bit is bounded by some fixed T is equivalent to saying that the system offers to 
the flow a service curve equal to 6t- 

Non Premptive Priority Node Consider a node that serves two fiows, Rh{1) and 
RL{t). The first flow has non-preemptive priority over the second one (Figure 1.9). 
This example explains the general framework used when some traffic classes have 
priority over some others, such as with the Internet differentiated services [6]. The 
rate of the server is constant, equal to C. Call R*jj{t) and Rf{t) the outputs for 
the two flows. Consider first the high priority flow. Fix some time t and call s the 



Rn(t) 

Rr(t) 



High priority 

— ► 



Q- 



rate c 



Low priority 



RVt) 

R\(t) 



> 



Figure 1.9: Two priority fiows (H and L) served with a preemptive head of the 
line (HOL) service discipline. The high priority fiow is constrained by arrival 
curve a. 

beginning of the backlog period for high priority traffic. The service for high priority 
can be delayed by a low priority packet that arrived shortly before s ' , but as soon as 
this packet is served, the server is dedicated to high priority as long as there is some 
high priority traffic to serve. Over the interval (s, t], the output is C{t — s)Thus 

R*H{t)-RH{s)>C{t-s)-l^,^ 

where is the maximum size of a low priority packet. Now by definition of s: 
R}j{s) = Rh{s) thus 
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R*u{t) > Rh{s) + C{t - s)-l^ 



Now we have also 



R*H(t) - Rh{s) = R*H(t) - R*h(s) > 0 
from which we derive 



R*H{t)>RH{s) + [C{t-s)-l^,J+ 

The function u [Cu — is called the rate-latency function with rate C and 

latency [67] (in this hook we note it /3 , see also Figure 3.1 on page 130). 

C,-‘^ 

Thus the high priority traffic receives this function as a service curve. 

Now let us examine low priority traffic. In order to assure that it does not starve, 
we assume in such situations that the high priority flow is constrained by an arrival 
curve aH- Consider again some arbitrary time t. Call s' the beginning of the server 
busy period (note that s' < s). At time s', the backlogs for both flows are empty, 
namely, R’^(s') = Rh{s') and R*j^{s') = Rl(s'). Over the interval {s' ,t], the 
output is C{t — s'). Thus 

Rlit) - Rlis') = C{t - s') - [i?Jf(i) - i?Jf(s')] 

Now 

R*H{t) - R*h{s') = Rhit) - Rh{s') < Rnit) - Rh{s') < an{t - s') 

and obviously R}j{t) — R}j{s') > 0 thus 

Rl{t) - Rl{s') = Rl{t) - Rl(s') > S{t - s') 

with S(u) = (Cu — ajj (u))^. Thus, if S is wide-sense increasing, the low-priority 
flow receives a service curve equal to function S. Assume further that an = ^r,b, 
namely, the high priority flow is constrained by one single leaky bucket or GCRA. 
In that case, the service curve S{t) offered to the low-priority flow is equal to the 
rate-latency function (in^rit), with R = C — r and T = 

We have thus shown the following. 

Proposition 1.3.4. Consider a constant bit rate server, with rate C, serving two 
flows, H and L, with non-preemptive priority given to flow H. Then the high priority 
flow is guaranteed a rate-latency service curve with rate C and latency where 
^max maximum packet size for the low priority flow. 

If in addition the high priority flow is ^r,bStnooth, with r < C, then the low 

priority flow is guaranteed a rate-latency service curve with rate C — r and latency 
b 

C-r- 

This example justifies the importance of the rate-latency service curve. We will 
also see in Chapter 2 (Theorem 2.1.1 on page 87) that all practical implementations 
of GPS offer a service curve of the rate-latency type. 
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Strict service curve An important class of network nodes fits in the following 
framework. 

Definition 1.3.2 (Strict Service Curve). Wfe say that system S offers a strict service 
curve (3 to a flow if, during any backlogged period of duration u, the output of the 
flow is at least equal to fliu). 

A GPS node is an example of node that offers a strict service curve of the form 
fl{t) = rt. Using the same busy-period analysis as with the GPS example in the 
previous section, we can easily prove the following. 

Proposition 1.3.5. If a node offers (3 as a strict service curve to a flow, then it also 
offers (3 as a service curve to the flow. 

The strict service curve property offers a convenient way of visualizing the ser- 
vice curve concept: in that case, (3{u) is the minimum amount of service guaranteed 
during a busy period. Note however that the concept of service curve, as defined in 
Definition 1.3.1 is more general. A greedy shaper (Section 1.5.2) is an example of 
system that offers its shaping curve as a service curve, without satisfying the strict 
service curve property. In contrast, we will find later in the book some properties that 
hold only if a strict service curve applies. The framework for a general discussion 
of strict service curves is given in Chapter 7. 

Variable Capacity Node Consider a network node that offers a variable service 
capacity to a flow. In some cases, if is possible fo model the capacity by a cumu- 
lative function M(f), where M(f) is the total service capacity available to the flow 
between times 0 and t. For example, for an ATM system, think of M (t) as the num- 
ber of time slots between times 0 and t that are available for sending cells of the 
flow. Let us also assume that the node buffer is large enough so that overflow is nol 
possible. The following proposition is obvious buf importanf in practice 

Proposition 1.3.6. If the variable capacity satisfies a minimum guarantee of the 
form 

M{f)-M{s)>(3{t-s) (1.10) 

for some fixed function (3 and for all 0 < s < t, then (3 is a strict service curve. 

Thus f3 is also a service curve for that particular flow. The concepf of variable 
capacity node is also a convenient way to establish service curve properties. For an 
application to real time systems (rather than communication networks) see [70]. 

We will show in Chapter 4 that the output of the variable capacity node is given 
by 

R*{t)= ini {M{t) - M{s) + R{s)} 

0<s<t 

Lastly, coming back to the priority node, we have: 

Proposition 1.3.7. The service curve properties in Proposition 1.3.4 are strict. 

The proof is left to the reader. It relies on the fact that constant rate server is a 
shaper. 
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1.4 Network Calculus Basics 

In this section we see the main simple network calculus results. They are all hounds 
for lossless systems with service guarantees. 

1.4.1 Three Bounds 

The first theorem says that the backlog is bounded by the vertical deviation between 
the arrival and service curves: 

Theorem 1.4.1 (Backlog Bound). Assume a flow, constrained by arrival curve a, 
traverses a system that offers a service curve /). The backlog R{t) — R*{t) for all t 
satisfies: 

R(t) — R*(t) < sup{a:(s) — /3(s)} 

s>0 



Proof: The proof is a straightforward application of the definitions of service and 

arrival curves: 



R{t) - R*{t) < R{t) - inf [R{t - s) + /3(s)] 

0<s<t 

Thus 

R(t) — R*(t) < sup — R(t — s) + /3(s)] < sup [q:(s) + (3{t — s)] 

Q<s<t Q<s<t 



□ 

We now use the concept of horizontal deviation, defined in Chapter 3, Equa- 
tion (3.21). The definition is a little complex, but is supported by the following 
intuition. Call 

S(s) = inf {r > 0 : a(s) < /3(s -I- r)} 

From Definition 1.1.1, S(s) is the virtual delay for a hypothetical system that would 
have a as input and /) as output, assuming that such a system exists (in other words, 
assuming that (a < /)). Then, h{a, /)) is the supremum of all values of S{s). The 
second theorem gives a bound on delay for the general case. 

Theorem 1.4.2 (Delay Bound). Assume a flow, constrained by arrival curve a, 
traverses a system that offers a service curve of [3. The virtual delay d{f) for all t 
satisfies: d(t) < h(a,l3). 

Proof: Consider some fixed f > 0; for all r < d(f), we have, from the definition 

of virtual delay, R{t) > R*{t + r). Now the service curve property at time t + t 
implies that there is some sq such that 



R{t) > R{t -I- r - So) + P{so) 
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It follows from this latter equation that t + r — sq < t. Thus 

a{r - So) > lB(t) - B(t + r - sq)] > /3(so) 

Thusr < S(t—so) < /i(q!, /3). This is true for all r < d(t)thusd(t) < h(a,/3). □ 

Theorem 1.4.3 (Output Flow). Assume a flow, constrained by arrival curve a, 
traverses a system that offers a service curve of (3. The output flow is constrained by 
the arrival curve a* = a (Z) j3. 

The theorem uses min-plus deconvolution, introduced in Chapter 3, which we 
have already used in Theorem 1.2.2. 



Proof: With the same notation as above, consider B*{t) — B*{t — s), for 0 < 

t — s <t. Consider the definition of the service curve, applied at time t — s. Assume 
for a second that the inf in the definition of i? ® /3 is a min, that is to say, there is 
some u > 0 such that 0 < t — s — u and 



{B ® P){t — s) = B{t — s — u) + (3{u) 



Thus 

B*{t — s) — B(t — s — u) > (3{u) 

and thus 

B*{f) -B*{t-s)< B*(t) - I3(u) -B(t-s-u) 

Now B*{t) < B{t), therefore 

B*{t) — B*{t — s) < B{t) — B{t — s — u) — (3(u) < a{s + u) — (3(u) 

and the latter term is bounded by {a 0 fl){s) by definition of the 0 operator. 

Now relax the assumption that the the inf in the definition of i? 0 /3 is a min. In 
this case, the proof is essentially the same with a minor complication. For all e > 0 
there is some u > 0 such that 0 < t — s — u and 



{B 0 P){t — s) > B{t — s — u) + (3{u) — e 
and the proof continues along the same line, leading to: 

B*{f) - B*{t- s) < {a(Zl3){s)+e 

This is true for all e > 0, which proves the result. □ 



A simple Example and Interpretation of Leaky Bucket Consider a flow con- 
strained by one leaky bucket, thus with an arrival curve of the form a = 7r,(>, served 
in anode with the service curve guarantee (3r^t- The alert reader will enjoy applying 
the three bounds and finding the results shown in Figure 1.10. 

Consider in particular the case T = 0, thus a flow constrained by one leaky 
bucket served at a constant rate i?. If i? > r then the buffer required to serve the 
flow is b, otherwise, it is infinite. This gives us a common interpretation of the leaky 
bucket parameters r and b: r is the minimum rate required to serve the flow, and b is 
the buffer required to serve the flow at any constant rate > r. 
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Figure 1.10: Computation of buffer, delay and output bounds for an input flow 
constrained by one leaky bucket, served in one node offered a rate-latency 
service curve. If r < i?, then the buffer bound is x = 6 -i- rT, the delay bound 
is d = T -I- ^ and the burstiness of the flow is increased by rT. U r > R, the 
bounds are infinite. 

Example: VBR flow with rate-latency service curve Consider a VBR flow, de- 
fined by TSPEC (M, p, r, b). This means that the flow has a{t) = min(M +pt, rt + 
b) as an arrival curve (Section 1.2). Assume that the flow is served in one node that 
guarantees a service curve equal to the rate-latency function (3 = This exam- 

ple is the standard model used in Intserv. Let us apply Theorems 1.4.1 and 1.4.2. 
Assume that R> r, that is, the reserved rate is as large as the sustainable rate of the 
flow. 

From the convexity of the region between a and /3 (Figure 1.4.1), we see that 
the vertical deviation v = sup^>q[q;(s) — /3(s)] is reached for at an angular point of 
either a or (3. Thus 



with 0 = . Similarly, the horizontal distance is reached an angular point. In the 

figure, it is either the distance markes as AA' or BB' . Thus, the bound on delay d 
is given by 



After some max-plus algebra, we can re-arrange these results as follows. 

Proposition 1.4.1 (Intserv model, buffer and delay bounds). Consider a VBR 
flow, with TSPEC {M,p, r, b), served in a node that guarantees to the flow a service 
curve equal to the rate-latency function (3 = (3r^t- The buffer required for the flow 
is bounded by 



V = max[o;(T), a{0)\ 




V = b r max 




The maximum delay for the flow is bounded by 



d = 
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time 



Figure 1.11: Computation of buffer and delay bound for one VBR flow served 
in one Intserv node. 

We can also apply Theorem 1.4.3 and find an arrival curve a* for the output 
flow. We have a* = a (d {Xr ® St) = {a (d Xr) 0 St from the properties of 0 
(Chapter 3). Note that 

ifOST)(t)=f(t + T) 

for all / (shift to the left). 

The computation of a 0 Xr is explained in Theorem 3.1.14 on Page 152: it 
consists in inverting time, and smoothing. Here, we give however a direct derivation, 
which is possible since a is concave. Indeed, for a concave a, define f o as 

to = inf{f > 0 : o:'(f) < R} 

where a' is the left-derivative, and assume that to < -l-oo. A concave function 
always has a left-derivative, except mayhe at the ends of the interval where it is 
defined. Then by sfudying the variations of the function u — )• a{t + u) — Ru we 
hnd that {a 0 Xr){s) = a{s) if s > to, and {a 0 Xr){s) = a{to) + {s — to)R if 
s < to- 




Figure 1.12: Derivation of arrival curve for the output of a flow served In a 
node with rate-latency service curve f3R,T- 



Putting the pieces all together we see that the output function a * is obtained 
from a by 
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• replacing a on [0,io] by the linear function with slope R that has the same 
value as a for t = to, keeping the same values as a on [fo, +oo[, 

• and shifting by T to the left. 

Figure 1.12 illustrates the operation. Note that the two operations can be performed 
in any order since ® is commutative. Check that the operation is equivalent to the 
construction in Theorem 3.1.14 on Page 152. 

If we apply this to a VBR connection, we obtain the following result. 

Proposition 1.4.2 (Intserv model, output bound). With the same assumption as 
in Proposition 1.4.1, the output flow has an arrival curve a * given by: 

j if ^ <Tthena*{t) = b + r{T + t) 

I else a*{t) = min | {t + T){p A R) + M + — R)~^ ,b + r{T + f)| 

An ATM Example Consider the example illustrated in Figure 1.13. The aggregate 
flow has as an arrival curve equal to the stair function 10u25,4- The figure illustrates 
that the required buffer is 18 ATM cells and the maximum delay is 10 time slots. We 




Figure 1.13: Computation of bounds for buffer x and delay d for an ATM ex- 
ample. An ATM node serves 10 ATM connections, each constrained with 
GCRA(25,4) (counted in time slots). The node offers to the aggregate flow 
a service curve with rate R = I cell per time slot and latency T = 8 
time slots. The figure shows that approximating the stair function 10r;25,4 by 
an affine function results into an overestimation of the bounds. 



know from Corollary 1 .2. 1 that a GCRA constraint is equivalent to a leaky bucket. 
Thus, each of the 10 connections is constrained by an affine arrival curve 7^,6 with 
r = ^ = 0.04 and fo=l-|-^ = 1.16. However, if we take as an arrival curve 
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for the aggregate flow the resulting affine function then we obtain a buffer 

bound of 11.6 and a delay bound of 19.6. The affine function overestimates the 
buffer and delay bounds. Remember that the equivalence between stair function and 
affine function is only for a flow where the packet size is equal to the value of the 
step, which is clearly not the case for an aggregate of several ATM connections. 

A direct application of Theorem 1 .4.3 shows that an arrival curve for the output 
flow is given by Q!o(f) = a{t + T) = V 25 , 12 {t). 

In Chapter 2, we give a slight improvement to the bounds if we know that the 
service curve is a strict service curve. 

1.4.2 Are the Bounds Tight ? 

We now examine how good the three bounds are. For the backlog and delay bounds, 
the answer is simple: 

Theorem 1.4.4. Consider the backlog and delay bounds in Theorems 1.4.1 and 
1.4.2. Assume that 

• a is a “good” function ( that is, namely, is wide-sense increasing, sub-additive 
andaiU) = OJ 

• (3 is wide-sense increasing and /3(0) =0 

Then the bounds are tight. More precisely, there is one causal system with input flow 
R{t) and output flow R*{t), such that the input is constrained by a, offering to the 
flow a service curve fl, and which achieves both bounds. 

A causal system means that R{t) < R* (t). The theorem means that the backlog 
bound in Theorem 1.4.1 is equal to supj>o[J?(f) — R*{t)], and the delay bound in 
Theorem 1.4.1 is equal to supj>o d{t). In the above, d{t) is the virtual delay defined 
in Definition 1.1.1. 

Proof: We build one such system R,R* by defining R = a, R* = min(a:,/3). 

The system is causal because R* < a = R. Now consider some arbitrary time t. If 
a(t) < (3{f) then 

R*{f) = R(t) = R(t) 4- 13(0) 

Otherwise, 

R*(t) =p(t) =i?(0)+/3(f) 

In all cases, for all t there is some s <t such that R*(t) > R(t — s) + /3(s), which 
shows the service curve property. □ 

Of course, the bounds are as tight as the arrival and service curves are. We have 
seen that a source such that R(t) = a(t) is called greedy. Thus, the backlog and 
delay bounds are worst-case bounds that are achieved for greedy sources. 

In practice, the output bound is also a worst-case bound, even though the detailed 
result is somehow less elegant. 
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Theorem 1.4.5. Assume that 

1. a is a “good” function (that is, is wide-sense increasing, sub-additive and 

Q!(0) = Oj 

2. a is left-continuous 

3. (3 is wide-sense increasing and /3(0) = 0 

4. a(Z)Oi is not bounded from above. 

Then the output bound in Theorem 1.4.3 is tight. More precisely, there is one causal 
system with input flow R(t) and output flow R*(t), such that the input is constrained 
by a, offering to the flow a service curve (3, and a* (given by Theorem 1.4.3) is the 
minimum arrival curve for R*. 

We know in particular from Section 1.2 that the first three conditions are not 
restrictive. Let us first discuss the meaning of the last condition. By definition of 
max-plus deconvolution: 

{a(da){t) = inf {a;(f + s) — Q;(s)} 

5>0 

One interpretation of a0a is as follows. Consider a greedy source, with R(t) = 
a{t)’, then (a0a){t) is the minimum number of bits arriving over an interval of du- 
ration t. Given that the function is wide-sense increasing, the last condition means 
that limj_j._|_oo(Q:0ci:)(i) = +oo. For example, for a VBR source with T-SPEC 
(p,M,r,b) (Figure 1.5), we have (a<Z)a){t) = rt and the condition is satisfied. 
The alert reader will easily be convinced that the condition is also true if the arrival 
curve is a stair function. 

The proof of Theorem 1.4.5 is a little technical and is left at the end of this 
chapter. 

We might wonder whether the output bound a * is a “good” function. The answer 
is no, since a* (0) is the backlog bound and is positive in reasonable cases. However, 
a* is sub-additive (the proof is easy and left to the reader) thus the modified function 
(5o A a* defined as a* (t) for f > 0 and 0 otherwise is a “good” function. If a is left- 
continuous, (5o A a* is even a “very good” function since we know from the proof 
of Theorem 1.4.5 that it is left-continuous. 

1.4.3 Concatenation 

So far we have considered elementary network parts. We now come to the main 
result used in the concatenation of network elements. 

Theorem 1.4.6 (Concatenation of Nodes). Assume a flow traverses systems iSi 
and 1 S 2 in sequence. Assume that Si offers a service curve of ff, i = 1,2 to the flow. 
Then the concatenation of the two systems offers a service curve of (3 \ ® (32 to the 
flow. 
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Proof: Call Ri the output of node 1 , which is also the input to node 2. The service 

curve property at node 1 gives 



Ri > R^/3i 



and at node 2 

i?* > i?i ® /32 > (i? ® /3i) ® /32 = ® iPi ® P 2 ) 

□ 

Examples: Consider two nodes offering each a rate-latency service curve (3 

i = 1, 2, as is commonly assumed with Intserv. A simple computation gives 

Thus concatenating Intserv nodes amounts to adding the latency components and 
taking the minimum of the rates. 

We are now also able to give another interpretation of the rate-latency service 
curve model. We know that (Ir^t = (<^t ® Aij)(f); thus we can view a node offering 
a rate-latency service curve as the concatenation of a guaranteed delay node, with 
delay T and a constant bit rate or GPS node with rate R. 

Pay Bursts Only Once The concatenation theorem allows us to understand a phe- 
nomenon known as “Pay Bursts Only Once”. Consider the concatenation of two 
nodes offering each a rate-latency service curve PR.^Ti, i = 1, 2, as is commonly 
assumed with Intserv. Assume the fresh input is constrained by 'Yr,b- Assume that 
r < Ri and r < R2. We are interested in the delay bound, which we know is a 
worst case. Let us compare the results obtained as follows. 

1 . by applying the network service curve; 

2. by iterative application of the individual bounds on every node 
The delay bound Dq can be computed by applying Theorem 1.4.2: 

Do = — + To 

with R = min,(i?j) and Tq = T, as seen above. 

Now apply the second method. A bound on the delay at node 1 is (Theorem 
1.4.2): 

The output of the first node is constrained by a * , given by : 



a*{t) = b + r X {t + Ti) 
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A bound on the delay at the second buffer is: 



D2 



h + rTi 

i?2 



+ T2 



And thus 



^ ^ b b + rT\ 

Di + D2 = — H h To 

J~l\ JX2 



It is easy to see that Dq < D\ + T> 2 . In other words, the bounds obtained by con- 
sidering the global service curve are better than the bounds obtained by considering 
every buffer in isolation. 

Let us continue the comparison more closely. The delay through one node has 
the form -^ + T\ (for the hrst node). The element ^ is interpreted as the part 
of the delay due to the burstiness of the input flow, whereas T\ is due to the delay 
component of the node. We see that Di + D 2 contains twice an element of the form 
whereas Do contains it only once. We sometimes say that “we pay hursts only 
once”. Another difference between Do and T>i -|- D 2 is the element it is due to 
the increase of burstiness imposed by node 1. We see that this increase of burstiness 
does not result into an increase of the overall delay. 

A corollary of Theorem 1.4.6 is also that the end-to-end delay bound does not 
depend on the order in which nodes are concatenated. 



1.4.4 Improvement of Backlog Bounds 

We give two cases where we can slightly improve the backlog bounds. 

Theorem 1.4.7. Assume that a lossless node ojfers a strict service curve (3 to a 
flow with arrival curve a. Assume that a{uo) < l3{uo) for some uo > 0. Then 
the duration of the busy period is < Uo- Furthermore, for any time t, the backlog 
R{t) — R*{t) satisfies 

R(t) — R*(t) < sup [T(f) — R(t — u)— (3{u)\ < sup [q;(u) — (3{u)\ 

u:0<u<uq u:0<u<uq 

The theorem says that, for the computation of a buffer bound, it is sufficient to 
consider time intervals less than uo- The idea is that the busy period duration is less 
than Uo- 



Proof: Consider a given time t at which the buffer is not empty, and call s the last 

time instant before t at which the buffer was empty. Then, from the strict service 
curve property, we have 

R*{f) > R*{s) + l3{t — s) = x(s) + l3{t — s) 

Thus the buffer size b{t) = R{t) — R*{t) at time t satishes 

b(t) < R(t) — R(s) — (3{t — s) < a(t — s) — (3{t — s) 
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Now if t — s > uo, then there is a time t' = s + uq, with s + 1 < t' < t such 
that b{t') = 0. This contradicts the definition of s. Thus we can assume that t — s < 
Uq. □ 

Theorem 1.4.8. Assume that a lossless node offers a service curve (3 to a flow with 
sub-additive arrival curve a. Assume that (3 is super-additive, and that a{uo) < 
f3(uff) for some uq > 0. Then for any time t, the backlog R(t) — R*(t) satisfies 

R(t) — R*(t) < sup [7?(f) — R(t — u)— (3{u)\ < sup [q!(u) — (3{u)\ 

u:0<u<uq u:0<u<uq 

Note that the condition that a is super- additive is not a restriction. In contrast, 
the condition that (3 is super-additive is a restriction. It applies in particular to rate- 
latency service curves. The theorem does not say anything about the duration of the 
busy period, which is consistent with the fact we do not assume here that the service 
curve is strict. 

Proof: For an arbitrary time t the backlog at time t satisfies 

h{f) < sup — R{t — u)— (3{u)\ 

ii>0 

For ,s <t define k = \ and s' = kuo + s. We have s < s' <t and 

t — Uq < s' (1-11) 

Now from the super-additivity of (3: 

R{t) - R{s) < [R{t) - R{s') - I3{t - s')] + [R{s') ~ R{s) - l3{s' - s)j 
Note that for the second part we have 

R{s') — R{s) — j3{s' — s) < k [q!(uo) — l3{uo)] < 0 



thus 

R(t) - R(s) < [R(t) - R(s') - (3{t - s')] 
which shows the theorem. □ 



1.5 Greedy Shapers 

1.5.1 Definitions 

We have seen with the definition of the leaky bucket and of the GCRA two examples 
of devices that enforce a general arrival curve. We call po/icerwith curve a a device 
that counts the bits arriving on an input flow and decides which bits conform with an 
arrival curve of a. We call shaper, with shaping curves, a bit processing device that 
forces its output to have a as an arrival curve. We call greedy shaper a shaper that 
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delays the input bits in a buffer, whenever sending a bit would violate the constraint 
a, but outputs them as soon as possible. 

With ATM and sometimes with Intserv, traffic sent over one connection, or flow, 
is policed at the network boundary. Policing is performed in order to guarantee that 
users do not send more than specified by the contract of the connection. Traffic 
in excess is either discarded, or marked with a low priority for loss in the case of 
ATM, or passed as best effort traffic in the case of Intserv. In the latter case, with 
IPv4, there is no marking mechanism, so it is necessary for each router along the 
path of the flow to perform the policing function again. 

Policing devices inside the network are normally buffered, they are thus shapers. 
Shaping is also often needed because the output of a buffer normally does not con- 
form any more with the traffic contract specified at the input. 

1.5.2 Input-Output Characterization of Greedy Shapers 

The main result with greedy shapers is the following. 

Theorem 1.5.1 (Input-Output Characterization of Greedy Shapers). Consider 
a greedy shaper with shaping curve a. Assume that the shaper buffer is empty at 
time 0, and that it is is large enough so that there is no data loss. For an input flow 
R, the output R* is given by 

R*=R^d (1.12) 

where a is the sub-additive closure of a. 

Proof: Remember first that if a is sub-additive and (t( 0) = 0, then d = <t. In 

general, we know that we can replace ahy a without changing the definition of the 
shaper. We thus assume without loss of generality that a = a. 

The proof of the theorem is an application of min-plus algebra. First, let us 
consider a virtual system that would take R as input and have an output S satisfying 
the constraints: 

{ ^ - ^ (1 13) 

Such a system would behave as a buffer (the first equation says that the output 
is derived from the input) and its output would satisfy the arrival curve constraint 
(T. However, such a system is not necessarily a greedy shaper; we could have for 
example a lazy shaper with S{t) = 0 for all f > 0 ! For this system to be a greedy 
shaper, it has to output the bits as soon as possible. Now there is a general result 
about systems satisfying conditions 1.13. 

Lemma 1.5.1 (A min-plus linear system). Assume that a is a “good” function 
(that is, is sub-additive and <t(0) =0). Among all functions S(t) satisfying condi- 
tions 1.13 for some fixed function R, there is one that is an upper bound for all. It is 
equal to R® a 
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Proof of the lemma: The lemma is a special case of a general result in Chapter 4. 

However, it is also possible to give a very simple proof, as follows. 

Define S* = R^a. Since cr is a “good” function, it follows immediately that S'* 
is a solution to System (1.13). Now, let S' be some other solution. We have S' < R 
and thus 

S' < So a = S* 

Therefore S* is the maximal solution. □ 

Note that the lemma proves the existence of a maximal solution to System (1.13). 
Note also that, in the lemma, function R need not be wide-sense increasing. 

Now we can use the lemma by showing that R* = S* . Function R is wide- 
sense increasing, thus so is S*. Obviously, R* is a solution of System (1.13), thus 
R*{t) < S*{t) for all t. Now if there would be some t such that R*{t) ^ S*{t), 
then this would contradict the condition that the greedy shaper attempts to send the 
bits out as early as possible. □ 

The following corollary derives immediately. 

Corollary 1.5.1 (Service Curve offered by a Greedy Shaper). Consider a greedy 
shaper with shaping curve a. Assume that a is sub-additive and (t(0) = 0. This 
system offers to the flow a service curve equal to a. 



fresh traffic 



shaper 



Pi Pi 




R R* 



Figure 1.14: Reshaping example. 

Example: Buffer Sizing at a Re-shaper Re-shaping is often introduced because 
the output of a buffer normally does not conform any more with the traffic contract 
specified at the input. For example, consider a flow with the arrival curve (j(t) = 
min{pt -h M, rt -h b) that traverses a sequence of nodes, which offer a service curve 
= Pr,t- a greedy shaper, with shaping curve a, is placed after the sequence of 
nodes (Figure 1.14). The input to the shaper (R in the hgure) has an arrival curve 
a* , given by Proposition 1.4.2. Corollary 1.5.1 gives a service curve property for 
the greedy shaper, thus the buffer B required at the greedy shaper is the vertical 
distance v{a* , a). After some algebra, we obtain: 

if ^ < T then b5-Tr 

if^>Tandp>i? then M 4 - (b-MKp-R) ^ 

p—r — ^ p—r 

else M Tp 



B = 
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Corollary 1.5.2 (Buffer Occupancy at a Greedy Shaper). Consider a greedy 
shaper with shaping curve a. Assume that a is sub-additive and <t(0) = 0. Call 
R{t) the input function. The buffer occupancy x{t) at time t is given by 

x(t) = sup {R(t) — R(s) — a(t — s)} 

0<s<t 



Proof: The backlog is defined by x{t) = R{t) — R*{t), where R* is the output. 

We apply Theorem 1.5.1 and get: 

x(t) = R(t) — inf {R(s) + a(t — s)} = R(t) + sup {—R(s) — a(t — s)} 

0<s<t 0<s<t 

□ 

Note that Lemma 1.2.2 is a special case of this corollary. 

In min-plus algebraic terms, we say that a system is linear and time invariant 
if its input-output characterization has the form R* = R® (i (where /) is not nec- 
essarily sub-additive). We can thus say from the theorem that greedy shapers are 
min-plus linear and time invariant systems. There are min-plus linear and time in- 
variant system that are not greedy shapers. For example, a node imposing a constant 
delay T is characterized by the input-output relationship 

R* =R^ St 

Compare to the guaranteed delay node (namely, a node imposing a variable delay 
bounded by T), for which the input-output relationship is a service curve property : 

R* ~> R ^ St 

The rest of this Section illustrates similarly that the input-output characterization 
of greedy shapers R* = i? ® <t is much stronger than the service curve property 
described in Corollary 1.5.1. 

1.5.3 Properties of Greedy Shapers 

Consider again Figure 1.14. We have seen in the previous section how we can com- 
pute the buffer size required at the greedy shaper. Now if greedy shapers are intro- 
duced along a path, then some bits may be delayed at the shaper, thus the end-to-end 
delay might increase. However, this is not true, as the following results state that, 
from a global viewpoint, “greedy shapers come for free”. 

Theorem 1.5.2 (Re-Shaping does not increase delay or buffer requirements). 

Assume a flow, constrained by arrival curve a, is input to networks iSi and S 2 in 
sequence. Assume a greedy shaper, with curve a > a is inserted between S\ and S 2 . 
Then the backlog and delay bounds given by Theorem 1.4.2 for the system without 
shaper are also valid for the system with shaper. 

The condition a > a means that re-shaping mayhe only partial. 
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Proof: Call /3, the service curve of Si. The backlog bound in Theorem 1.4.1 is 

given by 

® (T ® /32) = (T ® /3i ® /32) (T15) 

Now the last expression is the backlog bound obtained if we put the shaper imme- 
diately at the entrance of the network. Clearly, this introduces no backlog, which 
shows that the overall backlog is not influenced by the shaper. The same reasoning 
applies to the delay bound. □ 

If you read carefully, you should not agree with the last paragraph. Indeed, there 
is a subtlety. The bounds in Section 1.4 are tight, but since we are using several 
bounds together, there is no guarantee that the resulting bound is tight. All we can 
say at this point is that the bound computed for the system with shaper is the same 
if we put the shaper in front; we still need to show that the bound for such a system 
is the same bound as if there would be no shaper. This can be proven in a number 
of ways. We give here a computational one. The proof relies on Lemma 1 .5.2, given 
below. □ 

Lemma 1.5.2. Let a and a be “good” functions. Assume a < a. Then for any 
function /3, v(a, a ® (3) = v(a, /3) and h(a, a ® ff) = h(a, /3). 

Proof: We use the reduction to min-plus deconvolution explained in Section 3.1.11 

We have: 

v{a,(T ^ P) = [a 0 (<T ® /3)] (0) 

Now from Theorem 3.1.12 on Page 148: a 0 (<t 0 /3) = (a 0 <t) 0 /3. Also, since 
(T > a, we have a <Z) a < a <Z) a. Now a<Z) a = a because a is a “good” function, 
thus 

0:0 (<T 0/3) =0:0/3 (L16) 

and finally r;(o, <t 0 /3) = v{a, /3). 

Similarly h{a, 0) = inf{(3 such that (o 0 /3)(— d) < 0} which, combined with 
Equation (1.16) proves that /i(o, (T 0 /3) =h{a,P). □ 

Consider again Figure 1.14. Assume that the first network element and the 
greedy shaper are placed in the same node. Theorem 1.5.2 says that the total buffer 
required for this combined node is the same as if there would be no greedy shaper at 
the output. Thus, if you can dynamically allocate buffer space from a common pool 
to the first network element and the greedy shaper, then the greedy shaper costs no 
memory. However, the greedy shaper does need some buffer space, as given in Equa- 
tion (1.14). Similarly, the theorem says that there is no penalty for the worst-case 
delay. 

In contrast, placing a greedy shaper has an obvious benefit. The burstiness of the 
flow admitted in the next network element is reduced, which also reduces the buffer 
required in that element. To be more concrete, consider the example “Pay Bursts 
Only Once” in Section 1.4.3. Assume that a re-shaper is introduced at the output 
of the first node. Then the input to the second node has the same arrival curve as 
the fresh input, namely, yj.,b instead of 'Yr,b+rTi ■ The buffer required for the flow at 
node 2 is then b + rT 2 instead of 6 -|- r(Ti -I- T 2 ). 
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The following result is another “physical” property of greedy shapers. It says 
that shaping cannot be undone by shaping. 

Theorem 1.5.3 (Shaping Conserves Arrival Constraints). Assume a flow with 
arrival curve a is input to a greedy shaper with shaping curve a. Assume a is a 
“good” function. Then the output flow is still constrained by the original arrival 
curve a. 



Proof: 



R* = R® a < {R® a) ® a 



since the condition R < R® a expresses that a is an arrival curve. Thus 



R* <R^a^a = R*^a 



□ 

The output of the greedy shaper has thus min(a!, a) as an arrival curve. If a 
is also a “good” function, we know (Lemma 1.2.5) that the sub-additive closure of 
min(ci!, (t) is a ® cr. 

Example (ATM Multiplexer): Consider an ATM switch that receives 3 ATM 

connections, each constrained by GCRA(10, 0) (periodic connections). The switch 
serves the connection in any work conserving manner and outputs them on a link 
with rate 1 cell per time slot. What is a good arrival curve for the aggregate output ? 

The aggregate input has an arrival curve a = 3vio,o- Now the server is a greedy 
shaper with shaping curve cr = vi^o, thus it keeps arrival constraints. Thus the output 
is constrained by 3?;io,o ® which is a “good” function. We have already met 
this example in Figure 1.6. 

1.6 Maximum Service Curve, Variable and Fixed De- 
lay 

1.6.1 Maximum Service Curves 

If we modify the sense of the inequation in the definition of service curve in Sec- 
tion 1.3, then we obtain a new concept, called maximum service curve, which is 
useful to ( 1 ) account for constant delays and ( 2 ) in some cases to establish a rela- 
tionship between delay and backlog. 

Definition 1.6.1 (Maximum Service Curve). Consider a system S and a flow 
through S with input and output function R and R*. We say that S offers to the 
flow a maximum service curve 7 if and only ifjGlF and R* < i? ® 7 

Note that the definition is equivalent to saying that 7 is wide-sense increasing 
and that 
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R*{t) < R{s) + 7 (i — s) 
for all t and all s < f, or equivalently 

R*{t) - R*{s) < B{s) + -fit - s) 

where B{s) is the backlog at time s. A greedy shaper with shaping curve a offers a 
both as a service curve and a maximum service curve. 

In general, the concept of maximum service curve is not as powerful as the 
concept of service curve. However, as we see below, it can be useful to account for 
maximum rates and for constant propagation delays. We also see in Chapter 6 that 
it allows us to find good bounds for aggregate multiplexing. 

The following propositions give two special cases of interest. Their proof is easy 
and left to the reader. 

Proposition 1.6.1 (Minimum Delay). A lossless node offers a maximum service 
curve equal to St if and only if it imposes a minimum virtual delay equal to T. 

Proposition 1.6.2 (Arrival Constraint on Output). Assume the output of a lossless 
node is constrained by some arrival curve a. Then the node offers a as a maximum 
service curve. 

Like minimum service curves, maximum service curves can be concatenated: 

Theorem 1.6.1 (Concatenation of Nodes). Assume a flow traverses systems iSi and 
1 S 2 in sequence. Assume that Si offers a maximum service curve of ^i, i = 1,2 to 
the flow. Then the concatenation of the two systems offers a service curve of^\ ® 72 
to the flow. 



Proof: The proof mimics the proof of Theorem 1 .4.6 



□ 



Application: Consider a node with a maximum output rate equal to c and with 

internal propagation delay equal to T. It follows from Theorem 1.6.1 and the two 
previous propositions that this node offers to any flow a maximum service curve 
equal to the rate-latency function iSc.rit) = [c(t — T)]"*". 

Maximum service curves do not allow us to derive as strong results as (ordinary) 
service curves. However, they can be used to reduce the output bound and, in some 
cases, to obtain a minimum delay bound. Indeed, we have the following two results. 

Theorem 1.6.2 (Output Flow, generalization of Theorem 1.4.3 ). Assume a flow, 
constrained by arrival curve a, traverses a system that offers a service curve (3 and 
a maximum service curve 7 . The output flow is constrained by the arrival curve 
a* = (a ® 7 ) 0 /). 
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Proof: Instead of a computational proof as with Theorem 1.4.3, it is simpler at 

this stage to use min-plus algebra. Call R and R* the input and output functions, 
and consider R* <Z) R*, the minimum arrival curve for R* . We have R* < i? ® 7 
and R* > R® (3, thus 



i?* 0 i?* < (i? ® 7) 0 (i? ® ( 3 ) 

From Rule 12 in Chapter 3, Theorem 3.1.12, applied to/ = i?0 7, p = i? and 
h = /3, we derive 

R* (Z)R* < {{R 0 7) 0 i?} 0 /3 

Now from the commutativity of 0 and from Rule 13 in Theorem 3.1.12: 

{{R 0 7) 0 i?} = {(7 0 i?) 0 i?} < {7 0 (i? 0 R)} 

Thus 

R* <Z)R* < {7 0 (i? 0 i?)} 0 /3 < (7 0 a) 0 /3 



□ 



Theorem 1.6.3 (Minimum Delay Bound). Assume a flow, constrained by arrival 
curve a, traverses a system that ojfers a maximum service curve ofy. Assume that 
y{D) = 0. The virtual delay d{t) satisfies d{t) > D for all t. 



Proof: We have R*{t) < R{t - D) + y{D) thus R* (t) < R{t - D) □ 

Note that the output bound is improved by the knowledge of the maximum ser- 
vice curve since in general we expect a 0 7 to be less than a. In contrast, the 
minimum delay bound gives some new information only in the cases where there is 
a latency part in the maximum service curve, which is the case for the first example 
(Minimum Delay ), but not in general for the second example (Arrival Constraint 
on Output). 



Numerical Example: Consider again the example illustrated in Figure 1.13. Let 

us first apply Theorem 1 .4.3 and compute an arrival curve a q for the output. The 
details are as follows. We have 

Q:q = 10u25,4 0 j3l,% = 10u25,4 0 (Al 0 dg) 

Now from Rule 15 in Chapter 3, we have 

= (10u25,4 0 (^s) 0 Al 

Now (10u25,4 0 Ss){t) = 10^25, 4 (f + 8) = 10^25, i 2 (i). and a straightforward 
application of the definition of 0 shows that finally q:q = ^25,12- 

Assume now that we have more information about the node, and that we can 
model is as node Ni dehned as the concatenation of two schedulers and a hxed 
delay element (Figure 1.15). Each scheduler offers to the aggregate flow a service 
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curve /3i?o,To with rate Rq = 1 cell per time slot and latency Tq = 2 time slots. The 
delay element is a link with maximum rate equal to 1 cell per time slot, and a fixed 
propagation and transmission delay equal to 4 time slots. The delay element is thus 
the combination of a greedy shaper with shaping curve A i (f) = t and a fixed delay 
element <^ 4 . We can verify that the concatenation of the three elements in node 1 
offers a service curve equal to /3i 2 ® Ai ® <^4 ® Pi , 2 = Pi,s- Now, from the delay 
element allows us to say that, in addition, the node also offers to the aggregate flow a 
maximum service curve equal to Pi, 4 . We can apply Theorem 1.6.2 and derive from 
that the output is constrained by the arrival curve a j given by 

Q!i = (a(^Pi,4) 0Pl,8 

The computation is similar to that of ag and involves the computation of 10u25,4 0 
Ai, which is similar to the example illustrated in Figure 1.6. Finally, we have: 

0^1 (i) = (10^25,4 ® Ai)(f + 4) 

Figure 1.15 shows that aj is a better bound than the arrival curve that we would 
obtain if we did not know the maximum service curve property. 

Assume next that we change the order of the delay element in node 51 and place 
it as the last element of the node. Call S 2 the resulting node. Then the conclusion of 
the previous paragraph remains, since the bounds are insensitive to the order, due to 
the commutativity of min-plus convolution. Thus the output of system S 2 also has 
al as an arrival curve. However, in that case, we can also model the delay element 
as the combination of a shaper, with shaping curve A 1 (corresponding to a fixed rate 
of 1 cell per time slot), followed by a fixed delay element, with constant delay equal 
to 4 time slots. The input to the shaper has an arrival curve equal to a 0 /3 1 , 4 , where 
a = 10u25,4 is the fresh arrival curve. Thus, from the properties of shapers, the 
output of the shaper is constrained by 

0:2 = (q; 0 Pi,4) 0 Ai = 10 u 25,8 0 Ai 

Since the fixed delay component does not alter the flow, the output of system S 2 has 
a 2 as an arrival curve. Figure 1.15 shows that a 2 is a better bound than a^. 

This fact is true in general: whenever a network element can be modeled as a 
shaper, then this model provides stronger bounds than the maximum service. 

1.6.2 Delay from Backlog 

In general it is not possible to bound delay from backlog with the framework of 
service curves, except in one particular but important case. 

Theorem 1.6.4. Assume a lossless node offers to a flow a minimum service curve P 
and a maximum service curve 7 , such that P{t) = 'y{t — v). Let f be the max-plus 
deconvolution 707 , that is. 
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Figure 1.15: Use of maximum service curve to improve output bound. The 
figure is for the same example as Figure 1.15. Top: nodes 5i and 52, two 
possible implementations of a system offering the overall service curve /3i,g. 
Middle: arrival curve a and overall service curve /3i,g. Bottom: constraint for 
the output. al (top curve, thick, plain line) is obtained with the only knowl- 
edge that the service curve is /3i,g. al (middle curve, thick, dashed line) is 
obtained assuming the system is 5i. (bottom curve, thin, plain line) is 
obtained assuming the system is 52. 
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Then the backlog B (t) and the virtual delay d{t) satisfy 

f{d{f) -v) < B{f) 

If in addition 7 is super-additive, then 

p{d{t)) < B(t) 



Proof: Fix some i > 0; we haved(i) = inf Et where the set Et is defined by 

Ef = {s >Q R* (t s) > R{t)} 

Since R* and R are wide-sense increasing, Ef is an interval. Thus 
d{t) = sup{s > 0 : i?*(t + s) < R{t)} 

We assume that R and R* are left-continuous. It follows that 

R*{t-\-d{t)) < R{t) 

For some arbitrary e, we can find some s such that 

R*{t d(t)) > R{s) P{t — s -I- d{t)) — e 
Now from the maximum service curve property 

R*{t)-R{s) < 7 (i-s) 



Combining the three gives 

B(t) = R(t)—R*(t) > l3(t—s-\-d(t))—j(t—s)—e = ^{t—s-\-d{f)—v)—^{t—s)—e 



and thus 

B{f) > inf[ 7 ((i(f) -v-\-u) — j(u)] (1-17) 

n>0 

From the definition of /, the latter term is f{d{t) — v). Finally, if 7 is super- additive, 
then 707 = 7 □ 

We can apply the theorem to a practical case: 

Corollary 1.6.1. Assume a lossless node offers to a flow a minimum service curve 
P = Pr,v t^nd a maximum service curve 7 = l3r,v'> with v' < v. The backlog B{f) 
and the virtual delay d{f) satisfy 



d{f) < 



m 



■L V 
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Proof: We apply the theorem and note that 7 is super-additive, because it is con- 
vex. □ 

1.6.3 Variable versus Fixed Delay 

Some network elements impose fixed delays (propagation and transmission), whereas 
some other network elements impose variable delays (queueing). In a number of 
cases, it is important to evaluate separately the total delay and the variable part of 
the delay. The total delay is important, for example, for determining throughput and 
response time; the variable part is important for dimensioning playout buffers (see 
Section 1.1.3 for a simple example, and chapter 5 for a more general discussion). We 
have seen at the end of end of Section 1.5.2 that a node imposing a constant delay 
can be modeled as a min-plus linear system. Beyond this, the concept of maximum 
service curve is a tool for telling apart variable delay from fixed delay, as follows. 

Consider a network, made of a series of network elements each element 

being the combination of a fixed delay d, and a variable delay. Assume fhe variable 
delay componenf offers a service curve /3j. A fixed delay component offers 64 ^ both 
as a service curve and as a maximum service curve. Define /) = /3i ® ... ® /3/; 
fhe network offers as end-to-end service curve /) ® and as end-to-end 

maximum service curve Assume the input flow is constrained by some 

arrival curve a; from Theorems 1.4.2 and 1.6.3, the end-to-delay d{t) satisfies 

di + ... + di < d(t) < h(a, j3 ® <5di+...+dJ 

By simple inspecfion, h{a,(3 ® <5di+...+dj) = d\ + ... + dj + h{a,P), fhus fhe 
end-to-end delay satisfies 

0 < d(t) — [di -I- ... -I- di] < h(a, (3) 

In fhe formula, d\ + ... + di fhe fixed part of the delay, and h{a, /)) is the variable 
part. Thus, for the computation of the variable part of the delay, we can simply 
ignore fixed delay components. 

Similarly, an arrival curve constraint for the output is 

a* = (a ® Sd,+...+di) 0 (/3 ® Sd,+...+di) = a(2)(3 

thus the fixed delay can be ignored for the computation of the output bound. 

For the determination of backlog, the alert reader can easily be convinced that 
fixed delays cannot be ignored. In summary: 

Proposition 1.6.3. 1. For the computation of backlog and fixed delay bounds, 

fixed or variable delay are modeled by introducing St functions in the ser- 
vice curves. As a consequence of the commutativity of 0, such delays can be 
inserted in any order along a sequence of buffers, without altering the delay 
bounds. 

2. For the computation of variable delay bounds, or for an arrival constraint on 
the output, fixed delays can be ignored. 
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1.7 Handling Variable Length Packets 

All results in this chapter apply directly to ATM systems, using discrete time mod- 
els. In contrast, for variable length packets (as is usually the case with IP services), 
there are additional subtleties, which we now study in detail. The main parts in this 
section is the definition of a packetizer, and a study of its effect on delay, bursti- 
ness and backlog bounds. We also revisit the notion of shaper in a variable length 
context. For the rest of this section, time is continuous. 

Throughout the section, we will consider some wide sense increasing sequences 
of packet arrival times T, > 0. We assume that for all t the set : T, < t} is finite. 

1.7.1 An Example of Irregularity Introduced by Variable Length 

Packets 

The problem comes from the fact that real packet switching systems normally output 
entire packets, rather than a continuous data flow. Consider the example illustrated 
in Figure 1.16. It shows the output of a constant bit rate trunk, with rate c, that 
receives as input a sequence of packets, of different sizes. Call the size (in 
bits) and the arrival epoch for the ith packet, i = 1, 2, .... The input function is 

R(t) = J2liMTi<t} ( 1 - 18 ) 

i 

In the formula, we used the indicator function 1 which is equal to 1 if expr is 
true, and 0 otherwise. 

We assume, as is usual in most systems, that we observe only entire packets 
delivered by the trunk. This is shown as R'{t) in the figure, which results from the 
bit-by-bit output i?* by a packetization operation. The bit-by-bit output R* is well 
understood; we know from Section 1.5 that R* = i? ® A^. However, what is the 
effect of packetization ? Do the results in Sections 1.4 and 1.5 still hold ? 

Certainly, we should expect some modifications. For example, the bit-by-bit out- 
put R* in the figure is the output of a greedy shaper with curve A^, thus it has A^ as 
an arrival curve, but this is certainly not true for R' . Worse, we know that a greedy 
shaper keeps arrival constraints, thus if R is cr-smooth for some a, then so is i?*. 
However, this is not true for i?'. Consider the following example (which is originally 
from [29]). Assume that a{t) = (max + rt with r < c. Assume that the input flow 
R{t) sends a first packet of size li = (max at time Ti = 0, and a second packet of 
size (2 at time T 2 = y. Thus the flow R is indeed cr-smooth. The departure time for 
the first packet is T{ = Assume that the second packet (2 is small, specifically, 
h < -(max; then the two packets are sent back-to-back and thus the departure time 
for the second packet is Tj =T{ + ^^. Now the spacing Tj — T{ is less than thus 
the second packet is not conformant, in other words, R' is not cr-smooth. Note that 
this example is not possible if all packets are the same size. 

We will see in this section that this example is quite general: packetizing variable 
length packets does introduce some additional irregularities. However, we are able 
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Figure 1.16: A real, variable length packet trunk of constant bit rate, viewed 
as the concatenation of a greedy shaper and a packetizer. The input is R{t), 
the output of the greedy shaper is R*{t), the final output is the output of the 
packetizer is R'{t). 



to quantify them, and we will see that the irregularities are small (but may be larger 
than the order of a packet length). Most results are extracted from [45] 

1.7.2 The Packetizer 

We first need a few definitions. 

Definition 1.7.1 (cumulative packet lengths). A sequence L of cumulative packet 
lengths is a wide sense increasing sequence (T(0) = 0, T(l), L{2 ), ...) such that 

^max = sup{L(n + 1) - L{n)} 



is finite 

In this chapter, we interpret L(n) — L(n — 1) as the length of the nth packet. We 
now introduce a new building block, which was introduced in [10]. 

Definition 1.7.2 (Function [10]). Consider a sequence of cumulative packet 
lengths L with L(0) = 0. For any real number x, define 

P^{x) = sup{L(n)l|i(„)<,,}} (1.19) 

nGN 



Figure 1.17 illustrates the definition. Intuitively, P^(x) is the largest cumulative 
packet length that is entirely contained in x. Function P^ is right-continuous; if R is 
right-continuous, then so is P^{R{t)). For example, if all packets have unit length, 
then L(n) = n and for a; > 0: P^{x) = \x\ . An equivalent characterization of P^ 



IS 



P^{x) = L{n) <;=> L{n) <x< L{n + 1) 



(1.20) 
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Figure 1.17: Definition of function P^. 



Definition 1.7.3 (Packetizer [10]). Consider a sequence L of cumulative packet 
lengths. An L-packetizer is the system that transforms the input R{t) into P^{R{t)). 

For the example in Figure 1.16, we have R'{t) = P^{R*{t)) and the system 
can thus he interpreted as the concatenation of a greedy shaper and a packetizer. 

The following equation follows immediately: 

X - fmax < P^(x) < X (1-21) 

Definition 1.7.4. VFe say that a flow R{f) is L-packetized if P^{R{t)) = R{f) for 
all t. 

The following properties are easily proven and left to the reader. 

• (The packetizer is isotone) If x <y then P^{x) < P^{y) for all x, j/ € IR. 

• (P^ is idempotent) P^(P^(x)) = P^{x) for all x G M 

• (Optimality of Packetizer) We can characterize a packetizer in a similar way 
as we did for a greedy shaper in Section 1.5. Among all flows x{t) such that 

J X is L-packetized 

\ X < i? 

there is one that upper-bounds all, and it is P^{R{t)). 

The proof for this last item mimics that of Lemma 1 .5. 1 ; it relies on the prop- 
erty that P^ is idempotent. 

We now study the effect of packetizers on the three bounds found in Section 1.4. 
We first introduce a definition. 

Definition 1.7.5 (Per-packet delay). Consider a system with L- packetized input 
and output. Call Ti , the arrival and departure time for the ith packet. Assume 
there is no packet loss. The per-packet delay is supfT^ — Tf) 
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Our main result in this section is the following theorem, illustrated in Fig- 
ure 1.18. 

Theorem 1.7.1 (Impact of packetizer). Consider a system (bit-by-hit system) with 
L-packetized input R and bit-by-bit output R* , which is then L-packetized to pro- 
duce a final packetized output R'. We call combined system the system that maps R 
into R'. Assume both systems are first-in-first-out and lossless. 

1. The per-packet delay /or the combined system is the maximum virtual delay 
for the bit-by-bit system. 

2. Call B* the maximum backlog for the bit-by-bit system and B ' the maximum 
backlog for the combined system. We have 

B* <B' <B* + (max 

3. Assume that the bit-by-bit system offers to the flow a maximum service curve 
7 and a minimum service curve (3. The combined system offers to the flow a 
maximum service curve 7 and a minimum service curve given by 

ff{t) = [(3{t) - (max]“^ 

4. If some flow S{t) has a{t) as an arrival curve, then P^{S{t)) has a{t) -\- 
(maxl{i>o} tin arrival curve. 

The proof of the theorem is given later in this section. Before, we discuss the 
implications. Item 1 says that appending a packetizer to a node does not increase 



Combined System 




Figure 1.18: The scenario and notation in Theorem 1 .7.1 . 

the packet delay at this node. However, as we see later, packetization does increase 
the end-to-end delay. 

Consider again the example in Section 1.7.1. A simple look at the hgure shows 
that the backlog (or required buffer) is increased by the packetization, as indicated 
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by item 2. Item 4 tells us that the final output R' has a'{t) = a (t) + /maxli>o as an 
arrival curve, which is consistent with our observation in Section 1.7.1 that R ' is not 
(T-smooth, even though R* is. We will see in Section 1.7.4 that there is a stronger 
result, in relation with the concept of “packetized greedy shaper”. 

Item 3 is the most important practical result in this section. It shows that pack- 
etizing weakens the service curve guarantee by one maximum packet length. For 
example, if a system offers a rate-latency service curve with rate R, then appending 
a packetizer to the system has the effect of increasing the latency by . 

Consider also the example in Figure 1.16. The combination of the trunk and the 
packetizer can be modeled as a system offering 

• a minimum service curve 

• a maximum service curve Ac 

Proof of Theorem 1.7.1 

1. For some t such that Ti <t < T,_|_i we have R{t) = L(i) and thus 

sup d{t) = d{Ti) 

iG[Ti.Ti + i) 

now 

d{Ti) =Tl-Ti 
Combining the two shows that 

sup d(t) = sup(T/ — Ti) 

t i 

2. The proof is a direct consequence of Equation (1.21). 

3. The result on maximum service curve 7 follows immediately from Equa- 
tion (1.21). Consider now the minimum service curve property. Fix some time 
t. For Ti < s < T,_|_i we have R{s) = R{Ti) and /) is wide-sense increasing, 
thus 

inf (R(s) + Pit - s)) = R(Ti) + pr(t - Ti) 
where P^it ~ Tp = infc>o{/3[(f — Tp -|- e]} is the limit of P to the right. Thus 
[R ®P)[t) = , inf {RiTP +p,it- TP) 

I such that li<t 

For a fixed t there is only a finite number of i such that Ti<t, thus the inf in 
the previous equation is a min and there is some j such that 

(R^p)(t)=R(Tj)+pr(t-Tj) 

By hypothesis, R*(t) > (i? ® P)(t), thus 
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R' (t) > R*{t) — Imax > R{Tj) + f^rit ~ Tj) — Imax 
On the other hand, R*(t) > R(Tj) and R is L-packetized, thus 

R'{t) > R{Tj) 



Combining the two shows that 

R'(t) > max [R(Tj),R(Tj) + (3r{t - Tj) - /^ax] 
= R{Tj) + max - Tj) - l^ax, 0] 

= R{Tj)+l3Ut-Tj) 



from which we conclude that R'{t) > info<s<i {R{s) + P' {t - s)) 

4. The proof is a direct consequence of Equation (1.21). 

Example: concatenation of GPS nodes Consider the concatenation of the the- 
oretical GPS node, with guaranteed rate R (see Section 1.3.1 on Page 22) and an 
L-packetizer. Assume this system receives a flow of variable length packets. This 
models a theoretical node that would work as a GPS node but is constrained to de- 
liver entire packets. This is not very realistic, and we will see in Chapter 2 more 
realistic implementations of GPS, but this example is sufficient to explain one im- 
portant effect of packetizers. 

By applying Theorem 1.7.1, we find that this node offers a rate-latency ser- 
vice curve Wx . Now concatenate m such identical nodes, as illustrated in Fig- 



Rate R, latency (m - 1) 




Figure 1.19: The concatenation of several GPS fluid nodes with packetized 
outputs 

ure 1.19. The end-to-end service curve is the rate latency-function (3 ji t with 



We see on this example that the additional latency introduced by one packetizer is 
indeed of the order of one packet length; however, this effect is multiplied by the 
number of hops. 

For the computation of the end-to-end delay bound, we need to take into account 
Theorem 1.7.1, which tells us that we can forget the last packetizer. Thus, a bound 
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on end-to-end delay is obtained by considering that the end-to-end path offers a 
service curve equal to the latency-function Pr^Tq with 

To = (to - 1) — 

For example, if the original input flow is constrained hy one leaky bucket of rate r 
and bucket pool of size b, then an end-to-end delay bound is 

b+{m- l)lmax 23) 

R 

The alert reader will easily show that this bound is a worst case bound. This il- 
lustrates that we should be careful in interpreting Theorem 1.7.1. It is only at the 
last hop that the packetizer implies no delay increase. The interpretation is as fol- 
lows. Packetization delays the first bits in a packet, which delays the processing at 
downstream nodes. This effect is captured in Equation (1.23). In summary: 

Remark 1.7.1. Packetizers do not increase the maximum delay at the node where 
they are appended. However, they generally increase the end-to-end delay. 

We will see in Chapter 2 that many practical schedulers can be modeled as the 
concatenation of a node offering a service curve guarantee and a packetizer, and we 
will give a practical generalization of Equation (1.23). 



1.7.3 A Relation between Greedy Shaper and Packetizer 

We have seen previously that appending a packetizer to a greedy shaper weakens 
the arrival curve property of the output. There is however a case where this is not 
true. This case is important for the results in Section 1.7.4, but also has practical 
applications of its own. Figure 1.20 illustrates the theorem. 




Ro(t) R(t) R*(t) R(i) (t) 



Figure 1.20: Theorem 1 .7.2 says that is c-smooth. 



Theorem 1.7.2. Consider a sequence L of cumulative packet lengths and call V l 
the L-packetizer. Consider a “ good" function a and assume that 

( There is a sub-additive function (Tq and a number I > (max such that 
\ cr(f) = ao(t) -b (lt>o 



(1.24) 
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Call Cfj the greedy shaper with shaping curve a. For any input, the output of the 
composition^V L ° ° Vl is a -smooth. 

In practical terms, the theorem is used as follows. Consider an L-packetized 
flow, pass it through a greedy shaper with shaping curve cr; and packetize the output; 
then the result is cr-smooth (assuming that a satisfies condition in Equation (1.24) 
in the theorem). 

Note that in general the output of ° is not L-packetized, even if a satisfies 
the condition in the theorem (finding a counter-example is simple and is left to the 
reader for her enjoyment). Similarly, if the input to V l ° Cfj is not L-packetized, 
then the output is not cr-smooth, in general. 

The theorem could also be rephrased by saying that, under condition in Equa- 
tion (1.24) 

Vl °Ca oVl = Ca oVl °Ca o Vl 
since the two above operators always produce the same output. 

Discussion of Condition in Equation (1.24) Condition Equation (1.24) is satis- 
fied in practice if a is concave and <7^(0) > (max, where <7^(0) = infj>o <^{t) is 
the limit to the right of a at 0. This occurs for example if the shaping curve is de- 
fined by the conjunction of leaky buckets, all with bucket size at least as large as the 
maximum packet size. 

This also sheds some light on the example in Figure 1.16: the problem occurs 
because the shaping curve Ac does not satisfy the condition. 

The alert reader will ask herself whether a sufficient condition for Equation ( 1 .24) 
to hold is that a is sub-additive and <7^(0) > (max- Unfortunately, the answer is no. 
Consider for example the stair function a = (max^'T- We have <7^(0) = (max but if 
we try to rewrite (7 into (7(f) = we must have ( = (max and (7o(f) = 0 

for t e (0, T]; if we impose that ao is sub-additive, the latter Implies (7 q = 0 which 
is not compatible with Equation (1.24).^ 

Proof of Theorem 1.7.2: We use the notation in Figure 1.20. We want to show 

that is (7-smooth. We have R* = i? ® (7. Consider now some arbitrary s and t 
with s < t. From the definition of min-plus convolution, for all e > 0, there is some 
u < s such that 

{R ® <7)(s) > R{u) a(s — u) — e (1-25) 

Now consider the set L of e > 0 such that we can find one u < s satisfying the 
above equation. Two cases are possible: either 0 is an accumulation point for E ^ 
(case 1) , or not (case 2). 

Consider case 1; there is a sequence (e„, Sn), with s„ < s, 

®We use the notation o C„ to denote the composition of the two operators, with C- applied first; 
see Section 4.1.3. 

^The same conclusion unfortunately also holds if we replace sub-additive by “star-shaped” (Sec- 
tion 3.1). 

^namely, there is a sequence of elements in E which converges to 0 
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lim e„ = 0 

n— >-+oo 



and 



{R ® cr)(s) > R{sn) + cr(s - s„) - e„ 

Now since s„ < t: 

{R ® (r)(i) < i?(s„) + (r(i - s„) 

Combining the two: 

{R ® a){t) - (i? ® cr)(s) < (r{t - s„) — a(s - s„) + e„ 



Now t — s„ > 0 and s — s„ > 0 thus 

0-(i - Sn) - cr(s - Sn) = (^o(t ~ Sn) “ (^o(s ~ S„) 
We have assumed that (Tq is sub-additive. Now t > s thus 

0-o(i - Sn) - Cro(s - Sn) < (^o(t ~ s) 



we have thus shown that, for all n 



{R ® (r){t) - (i? ® (t)(s) < doit - s) + e„ 

and thus 

(R ® a)(t) - (i? ® cr)(s) < ao(t - s) 

Now from Equation (1.21), it follows that 

R^^Ht) - R^^Hs) <ao(t- s) + /max < cr(f - s) 
which ends the proof for case 1 . 

Now consider case 2. There is some eo such that for 0 < e < eo, we have to take 
u = s in Equation (1.25). This implies that 

{R ® cr)(s) = R{s) 

Now R is L-packetized by hypothesis. Thus 

i?(l)(s) = P^{{R ® cr)(s)) = P^{R{s)) = R{s) = (i? ® cr)(s) 

thus 

= P^{{R^a){t) - (i?®cr)(s) 

< (i? ® (t) (f ) - (i? ® (t) (s) 

now R^ a has a as an arrival curve thus 

R^^'> (t) - R^^'> (s) < a(t-s) 



which ends the proof for case 2. 



□ 
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Example: Buffered Leaky Bucket Controller based on Virtual Finish Times 

Theorem 1.7.2 gives us a practical implementation for a packet based shaper. Con- 
sider that we want to build a device that ensures that a packet flow satisfies some 
concave, piecewise linear arrival curve (and is of course L- packetized). We can 
realize such a device as the concatenation of a buffered leaky bucket controller op- 
erating bit-by-bit and a packetizer. We compute the output time for the last bit of a 
packet (= finish time) under the bit-by-bit leaky bucket controller, and release the 
entire packet instantly at this finish time. If each bucket pool is at least as large as 
the maximum packet size then Theorem 1.7.2 tells us that the final output satisfies 
fhe leaky bucket constraints. 

Counter-example If we consider non-concave arrival curves, then we can find an 
arrival curve a that does satisfy a{t) > (max for t > 0 but that does not satisfy 
Equation (1.24). In such a case, the conclusion of Theorem 1.7.2 may not hold in 
general. Figure 1.21 shows an example where the output R is not cr-smooth, when 
(T is a stair function. 




Figure 1.21: A counter example for Theorem 1.7.2. A burst of 10 packets of 
size equal to 10 data units arrive at time t = 0, and a = 25vi. The greedy 
shaper emits 25 data units at times 0 and 1, which forces the packetizer to 
create a burst of 3 packets at time 1 , and thus is not cr-smooth. 



1.7.4 Packetized Greedy Shaper 

We can come back to the questions raised by the example in Figure 1.16 and give a 
more fundamental look at the issue of packetized shaping. Instead of synthesizing 
the concatenation of a greedy shaper and a packetizer as we did earlier, we define 
fhe following, consistent with Section 1.5. 
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Definition 1.7.6. [Packetized Greedy Shaper] Consider an input sequence of pack- 
ets, represented by the function R{t) as in Equation (1.18). Call L the cumulative 
packet lengths. We call packetized shaper, with shaping curve a, a system that forces 
its output to have a as an arrival curve and be L-packetized. We call packetized 
greedy shaper a packetized shaper that delays the input packets in a buffer, when- 
ever sending a packet would violate the constraint a, but outputs them as soon as 
possible. 

Example: Buffered Leaky Bucket Controller based on Bucket Replenishment 

The case a{t) = rniiim^i ... (t) can be implemented by a controller that 

observes a set of M fluid buckets, where the mth bucket is of size bm and leaks at a 
constant rate Vm - Every bucket receives 1, units of fluid when packet i is released (li 
is the size of packet i). A packet is released as soon as the level of fluid in bucket m 
allows it, that is, has gone down below bm — k, for all m. We say that now we have 
defined a buffered leaky bucket controller based on “bucket replenishment”. It is 
clear that the output has a as an arrival curve, is L-packetized and sends the packets 
as early as possible. Thus it implements the packetized greedy shaper. Note that this 
implementation differs from the buffered leaky bucket controller based on virtual 
finish times introduced in Section 1.7.3. In the latter, during a period where, say, 
bucket m only is full, fragments of a packet are virtually released at rate r m > bucket 
m remains full, and the (virtual) fragments are then re-assembled in the packetizer; 
in the former, if a bucket becomes full, the controller waits until it empties by at 
least the size of the current packet. Thus we expect that the level of fluid in both 
systems is not the same, the former being an upper bound. We will see however in 
Corollary 1.7.1 that both Implementations are equivalent. 

In this example, if a bucket size is less than the maximum packet size, then it is 
never possible to output a packet: all packets remain stuck in the packet buffer, and 
the output is R{t) = 0. In general, we can say that 

Proposition 1.7.1. Ifor (0) < (max then the the packetized greedy shaper blocks all 
packets for ever (namely, R(t) = 0). Thus in this section, we assume that a(f) > 
Ima^fort > 0 . 

Thus, for practical cases, we have to assume that the arrival curve a has a dis- 
continuity at the origin at least as large as one maximum packet size. 

How does the packetized greedy shaper compare with the concatenation of a 
greedy shaper with shaping curve a and a packetizer ? We know from the example 
in Figure 1.16 that the output has aft) = (r(f)-|-(maxli>o as an arrival curve, but not 
a. Now, does the concatenation implement a packetized greedy shaper with shaping 
curve a' ? Before giving a general answer, we study a fairly general consequence of 
Theorem 1.7.2. 

Theorem 1.7.3 (Realization of packetized Greedy Shaper). Consider a sequence 
L of cumulative packet lengths and a " good” function a. Assume that a satisfies the 
condition in Equation (1.24). Consider only inputs that are L packetized. Then the 
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packetized greedy shaper for a and L can be realized as the concatenation of the 
greedy shaper with shaping curve a and the L-packetizer. 




Figure 1.22: The packetized greedy shaper can be realized as a (bit-by-bit 
fluid shaper followed by a packetizer, assuming Equation (1 .24) holds. In 
practice, this means that we can realize packetized greedy shaping by com- 
puting finish times in the virtual fluid system and release packets at their 
finish times. 



Proof: Call R{t) the packetized input; the output of the bit-by-bit greedy shaper 

followed by a packetizer is (t) = P^{R (8> o')(f)). Call R{f) the output of the 
packetized greedy shaper. We have R < R thus R® a < R® cr and thus 

P^(R®a) < P^{R®a) 

But R is (T-smooth, thus R ^ a = R, and is L-packetized, thus P^{R (8) cr) = R. 
Thus the former inequality can be rewritten as i? < Conversely, from Theo- 

rem 1.7.2, is also cr-smooth and L-packetized. The definition of the packetized 
greedy shaper implies that R > R^^^ (for a formal proof, see Lemma 1.7.1) thus 
finally L = □ 

We have seen that the condition in the theorem is satisfied in particular if a 
is concave and (7^(0) > (max, for example if the shaping curve is defined by the 
conjunction of leaky buckets, all with bucket size at least as large as the maximum 
packet size. This shows the following. 

Corollary 1.7.1. For L-packetized inputs, the implementations of buffered leaky 
bucket controllers based on bucket replenishment and virtual finish times are equiv- 
alent. 

If we relax Equation (1.24) then the construction of the packetized greedy shaper 
is more complex: 
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Theorem 1.7.4 (I/O characterisation of packetized greedy shapers). Consider 
a packetized greedy shaper with shaping curve a and cumulative packet length L. 
Assume that a is a “ good” function. The output R{t) of the packetized greedy shaper 
is given by 

;R = inf I (^ 26 ) 

wiY/z /?(!)(/) = P^((cr (g) /?)(/) ) andR'^^Ht) = {{a ® R‘^^-'^'>){f)) for i > 2. 

Figure 1 .23 illustrates the theorem, and shows the iterative construction of the 
output on one example. Note that this example is for a shaping function that does 
not satisfy Equation (1.24). Indeed, otherwise, we know from Theorem 1.7.3 that 
the iteration stops at the first step, namely, R = R^^^ in that case. We can also check 
for example that if cr = (thus the condition in Proposition 1.7.1 is satisfied) then 
the result of Equation ( 1 .26) is 0 . 



2 3 4 5 



R(l) 







R 








pL ■<— 0 - 4 



R(2) 



R(3) 



R = R(4) 



Eigure 1.23: Representation of the output of the packetized greedy shaper 
(left) and example of output (right). The data are the same as with Fig- 
ure 1.21. 



Proof: The proof is a direct application of Lemma 1.7.1 (which itself is an appli- 
cation of the general method in Section 4.3 on Page 175). □ 

Lemma 1.7.1. Consider a sequence L of cumulative packet lengths and a “good” 
function a. Among all flows x{t) such that 



X < R 

X is L-packetized 
X has a as an arrival curve 



(1.27) 
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there is one flow R{t) that upper-bounds all. It is given by Equation ( 1.26). 

Proof: The lemma is a direct application of Theorem 4.3.1, as explained in Sec- 

tion 4.3.2. However, in order to make this chapter self-contained, we give an alter- 
native, direct proof, which is quite short. 

If a; is a solution, then it is straightforward to show by induction on i that x{t) < 
(t) and thus x < R. The difficult part is now to show that R is indeed a solution. 
We need to show that the three conditions in Equation (1.27) hold. Firstly, R < 
R(t) and by induction on i, < R for all i; thus R < R. 

Secondly, consider some hxed f; is L-packetized for all i > 1. Let 

L(no) := (f). Since (f) < (f), (f) is in the set 



{L(0),L(l),L(2),...,L(no)}. 



This set is finite, thus, R{t), which is the infimum of elements in this set, has to be 
one of the L{k) for k < hq. This shows that R{t) is L-packetized, and this is true 
for any time t. 

Thirdly, we have, for all i 

R(t) < = P^iia ® < (cr ® 



thus 

L < inf (cr ® i?W) 

i 

Now convolution by a hxed function is upper-semi-continuous, which means 

that 



inf(cr® i?W) = cr® J? 

i 

This is a general result in Chapter 4 for any min-plus operator. An elementary proof 
is as follows. 



infj(cr® i?W)(f) = 



inf^G[o.i].*GN [<7(s) + - s)] 

inf^G[o.i] {inf*GN [(^(s) + R^'Kt - s)] } 
inf^Gfo.i] {cr(s) + mfjgN “ «)] } 

inf5G[o,i] [cr(s) + R{t - s)] 

(<T ® R){t) 



Thus 



i? < (T ® J?, 



which shows the third condition. Note that R is wide-sense increasing. 



□ 



Does a packetized greedy shaper keep arrival constraints ? Figure 1 .24 shows 
a counter-example, namely, a variable length packet flow that has lost its initial 
arrival curve constraint after traversing a packetized greedy shaper. 

However, if arrival curves are dehned by leaky buckets, we have a positive result. 
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R 

R 




Figure 1.24: The input flow is shown above; it consists of 3 packets of size 
10 data units and one of size 5 data units, spaced by one time unit. It is 
a-smooth with a = 10ui,o- The bottom flow is the output of the packetized 
greedy shaper with a = 25u3,o- The output has a burst of 15 data units 
packets at time 3. It is cr-smooth but not a-smooth. 

Theorem 1.7.5 (Conservation of concave arrival constraints). Assume an L- 
packetized flow with arrival curve a is input to a packetized greedy shaper with 
cumulative packet length L and shaping curve a. Assume that a and a are concave 
with 0:^(0) > (max o.nd (7^(0) > (max- Then the output flow is still constrained by 
the original arrival curve a. 

Proof: Since a satisfies Equation (1.24), it follows from Theorem 1.7.3 that R = 

P^{cF ® R). Now R is a-smooth thus it is not modihed by a bit-by-bit greedy 
shaper with shaping curve a, thus R = a ® R. Combining the two and using the 
associativity of (8) gives R = P^[{a (8 a) 8 i?]. From our hypothesis, cr 8 a = 
min(cr, a) (see Theorem 3.1.6 on Page 136) and thus (t 8Q! satisfies Equation (1.24). 
Thus, by Theorem 1.7.2, i? is cr 8 a-smooth, and thus a-smooth. □ 

Series decomposition of shapers 

Theorem 1.7.6. Consider a tandem of M packetized greedy shapers in series; as- 
sume that the shaping curve (T™ of the mth shaper is concave with (T™(0) > (max- 
For L-packetized inputs, the tandem is equivalent to the packetized greedy shaper 
with shaping curve a = min^ o'™. 

Proof: We do the proof for M = 2 as it extends without difficulty to larger values 

of M. Call R{t) the packetized input, R'{t) the output of the tandem of shapers, and 
R{t) the output of the packetized greedy shaper with input R{t). 

Firstly, by Theorem 1.7.3 

R' =pi[cT2 8P^(f7^ 8i?)] 

Now (T™ > a for all m thus 



R' > P^[(J8P^(cr8i?)] 
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Again by Theorem 1.7.3, we have R = P^{a ^ R). Moreover R is L-packetized 
and (T-smooth, thus R = P^{R) and R = a ® R. Thus finally 

(1.28) 

Secondly, R' is L-packetized and by Theorem 1.7.5, it is cr-smooth. Thus the 
tandem is a packetized (possibly non greedy) shaper. Since R(t) is the output of the 
packetized greedy shaper, we must have R' < R. Combining with Equation (1.28) 
ends the proof. □ 

It follows that a shaper with shaping curve a{t) = min^^i ... + bm), 

where bm > (max for all to, can be implemented by a tandem of M individual 
leaky buckets, in any order. Furthermore, by Corollary 1.7.1, every individual leaky 
bucket may independently be based either on virtual finish times or on bucket re- 
plenishment. 

If the condition in the theorem is not satisfied, then the conclusion may not hold. 
Indeed, for the example in Figure 1.24, the tandem of packetized greedy shapers 
with curves a and cr does not have an a-smooth output, therefore it cannot be equiv- 
alent to the packetized greedy shaper with curve min(ci!, cr). 

Unfortunately, the other shaper properties seen in Section 1.5 do not generally 
hold. For shaping curves that satisfy Equation (1.24), and when a packetized greedy 
shaper is introduced, we need to compute the end-to-end service curve by applying 
Theorem 1.7.1. 



1.8 Lossless Effective Bandwidth and Equivalent Ca- 
pacity 

1.8.1 Effective Bandwidth of a Flow 

We can apply the results in this chapter to define a function of a flow called the 
effective bandwidth. This function characterizes the bit rate required for a given 
flow. More precisely, consider a flow with cumulative function R; for a fixed, but 
arbitrary delay D, we define the effective bandwidth e d{R) of the flow as the bit 
rate required to serve the flow in a work conserving manner, with a virtual delay 
< D. 



Proposition 1.8.1. The effective bandwidth of a flow is given by 

, R(t) — R(s) 

eoiR) = sup y 

0<s<t t — S JJ 



(1.29) 



For an arrival curve a we define the effective bandwidth e jj (a) as the effective 
bandwidth of the greedy flow R = a. By a simple manipulation of Equation 1.29, 
the following comes. 

Proposition 1.8.2. The effective bandwidth of a “good” arrival curve is given by 
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eoia) = sup (1.30) 

o<s s + D 

The alert reader will check that the effective bandwidth of a flow R is also the 
effective bandwidth of its minimum arrival curve R <Z) R. For example, for a flow 
with T-SPEC (p,M,r,b), the effective bandwidth is the maximum of r and the 
slopes of lines (QAq) and (QAi) in Figure 1.25; it is thus equal to: 

ec=max|^,r,p||l-|^^^j| (1.31) 

Assume a is sub-additive. We define the sustainable rate m as m = lim inf 5 _>_|_oo 




Q 
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Figure 1.25: Computation of Effective Bandwidth for a VBR flow (left); exam- 
ple for r = 20 packets/second, M = 10 packets, p = 200 packets per second 
and 6 = 26 packets (right). 



and the peak rate by p = sup^^g Then m < eoia) < p for all D. Moreover, 
if a is concave, then lim/ 3 _j._|_(X) eo io) = m.If a is differentiable, e{D) is the slope 
of the tangent to the arrival curve, drawn from the time axis at f = —D (Figure 
1.26). It follows also directly from the definition in (1.29) that 

eoi^ai) <'^eD{ai) (1.32) 

i i 

In other words, the effective bandwidth for an aggregate flow is less than or equal 
to the sum of effective bandwidths. If the flows have all identical arrival curves, 
then the aggregate effective bandwidth is simply / x e (ai ) . It is this latter relation 
that is the origin of the term “effective bandwidth”. The difference eoicti) — 
Oil) is a buffering gain; it tells us how much capacity is saved by sharing a 
buffer between the flows. 

1.8.2 Equivalent Capacity 

Similar results hold if we replace delay constraints by the requirement that a fixed 
buffer size is not exceeded. Indeed, the queue with constant rate C, guarantees a 
maximum backlog of B (in bits) for a flow i? if C > /s(i?), with 
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Figure 1.26: Effective Bandwidth for a delay constraint D and Equivalent Ca- 
pacity for a buffer size B 



fsiR) 



R(t) - R(s) - B 
sup 

0<s<t t ^ 



(1.33) 



Similarly, for a “good” function a, we have: 

/b(q:) = sup — — (1.34) 

s>0 s 

We call Jb (a) the equivalent capacity, by analogy to [42]. Similar to effective band- 
width, the equivalent capacity of a heterogeneous mix of flows is less than or equal 
to the sum of equivalent capacities of the flows, provided that the buffers are also 
added up; in other words, /b(q:) < ^ . fsi (o:*), with a = ^ . a* and B = Y,. Bi. 
Figure 1.26 gives a graphical interpretation. 

For example, for a flow with T-SPEC {p, M, r, b), using the same method as 
above, we find the following equivalent capacity: 



Ib = 



if B < M then -|- oo 

else r + 
else r -|- 



(1.35) 



An immediate computation shows that fb{'Jr,b) = t. In other words, if we allo- 
cate to a flow, constrained by an affine function 'Yr,b, a capacity equal to its sustain- 
able rate r, then a buffer equal to its burst tolerance b is sufficient to ensure loss-free 
operation. 

Consider now a mixture of Intserv flows (or VBR connections), with T-SPECs 
(Mi, Pi, Vi, bi). If we allocate to this aggregate of flows the sum of their sustainable 
rates Yi fi, then the buffer requirement is the sum of the burst tolerances Yi 
regardless of other parameters such as peak rate. Conversely, Equation 1.35 also 
illustrates that there is no point allocating more buffer than the burst tolerance: if 
B > b, then the equivalent capacity is still r. 

The above has illustrated that it is possible to reduce the required buffer or delay 
by allocating a rate larger than the sustainable rate. In Section 2.2, we described how 
this may be done with a protocol such as RSVP 
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Note that formulas ( 1 .29) or (1 .33), or both, can be used to estimate the capacity 
required for a flow, based on a measured arrival curve. We can view them as low- 
pass filters on the flow function R. 



1.8.3 Example: Acceptance Region for a FIFO Multiplexer 

Consider a node multiplexing n i flows of type 1 and ri 2 flows of type 2, where every 
flow is defined by a T-SPEC (p,, M,, r,, 6,). The node has a constanf outpuf rafe C. 
We wonder how many flows fhe node can accept. 

If the only condition for flow acceptance is that the delay for all flows is bounded 
by some value D, then the set of acceptable values of (n i , ri 2 ) is defined by 

en(niQ!i -I- n2Q!2) < C 

We can use the same convexity arguments as for the derivation of formula (1.31), 
applied to the function riiai + n 2 a 2 - Define and assume 6 i < 62 - The 

result is: 



eoiniai + 7120:2) = max < 



m A/ 1~|- 772^/2 
D ’ 

mMi + n2M2 + (nipi +n2P2)^i 
Oi+D 

ni6i + n2M2 + (nin + n2P2 )^2 
92-\-D ’ 

riiri -I- ri2r2 



The set of feasible (n 1 , 112 ) derives directly from the previous equation; it is the 
convex part shown in Figure 1.27. The alert reader will enjoy performing the com- 
putation of the equivalent capacity for the case where the acceptance condition bears 
on a buffer size B. 



i 


Pi 


Mi 


Ti 


hi 


Oi 




20 ’ 000 packets/s 


1 packet 


500 packets/s 


26 packets 


1.3 ms 


2 


5 ’000 packets/s 


1 packet 


500 packets/s 


25 1 packets 


55.5 ms 



Figure 1.27: Acceptance region for a mix of type 1 and type 2 flows. Maximum 
delay D = xx. The parameters for types 1 and 2 are shown in the table, 
together with the resulting values of 6*,. 



Coming back to equation 1.32, we can state in more general terms that the ef- 
fective bandwidth is a convex function of function a, namely: 

eniaai + (1 - 0)0:2) < aen(oi) + (1 - 0)6/3(02) 

for all a € [0,1]. The same is true for the equivalent capacity function. 

Consider now a call acceptance criterion based solely on a delay bound, or based 
on a maximum buffer constraint, or both. Consider further that there are I types of 
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connections, and define the acceptance region A as the set of values (n i, . . . , rij) 
that satisfy the call acceptance criterion, where n, is the number of connections 
of class i. From the convexity of the effective bandwidth and equivalent capacity 
functions, it follows that the acceptance region A is convex. In chapter 9 we compare 
this to acceptance regions for systems with some positive loss probability. 



Sustainable Rate Allocation If we are interested only in course results, then we 
can reconsider the previous solution and take into account only the sustainable rate 
of the connection mix. The aggregate flow is constrained (among others) by a{s) = 
b + rs, with b = ^ = J2i ■ Theorem 1 .4. 1 shows that the maximum 

aggregate buffer occupancy is bounded by b as long as C > r. In other words, 
allocating the sustainable rate guarantees a loss-free operation, as long as the total 
buffer is equal to the burstiness. 

In a more general setting, assume an aggregate flow has a as minimum arrival 
curve, and assume that some parameters r and b are such that 

lim a(s) — rs — b = 0 

5— >- + oo 

so that the sustainable rate r with burstiness 6 is a tight bound. It can easily be shown 
that if we allocate a rate C = r, then the maximum buffer occupancy is b. 

Consider now multiplexing a number of VBR connections. If no buffer is avail- 
able, then it is necessary for a loss-free operation to allocate the sum of the peak 
rates. In contrast, using a buffer of size b makes it possible to allocate only the sus- 
tainable rate. This is what we call the buffering gain, namely, the gain on the peak 
rate obtained by adding some buffer. The buffering gain comes at the expense of 
increased delay, as can easily be seen from Theorem 1 .4.2. 



1.9 Proof of Theorem 1.4.5 

Step 1: Consider a fixed time to and assume, in this step, that there is some time 

uo that achieves the supremum in the definition of a 0 /3. We construct some input 
and output functions R and R* such that R is constrained by a, the system (R, R*) 
is causal, and a* (to) = {R* 0 R*){to). R and R* are given by (Figure 1.28) 

R(t) = a(t) if t < Uo + to 
R(t) = a(uo + to) if t > Uo + to 
R*{t) = inf [a (f), /3(f)] ift<uo+to 
R* {t) = R{t) if t >uo +to 

It is easy to see, as in the proof of Theorem 1 .4.4 that R and R * are wide-sense 
increasing, that R* < R and that /3 is a service curve for the flow. Now 



R*{uo + to) - R*{uo) = a{uo +fo) - R*{uo) > a{uo +fo) - P{uo) = a* (to) 
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Figure 1.28: Step 1 of the proof of Theorem 1 .4.5: a system that attains the 
output bound at one value to. 

Step 2: Consider now a sequence of times to,ti, ...,tn, ... (not necessarily increas- 

ing). Assume, in this step, that for all n there is a value Un that achieves the supre- 
mum in the definition of {a 0 /3) (f„) . We prove that there are some functions R and 
R* such that R is constrained by a, the system (R, R*) is causal, has /3 as a service 
curve, and a*{tn) = {R* 0 R*)(tn) for all n > 0. 

We build R and i?* by induction on a set of increasing intervals [0, so]^0i 
[0 , s„] • • • ■ The induction property is that the system restricted to time interval [0 , s „] 
is causal, has a as an arrival curve for the input, has /3 as a service curve, and satisfies 
= {R* 0 R*){ti) for i <n. 

The first interval is defined by sq = uq + to', R and R* are built on [0, sq] as 
in step 1 above. Clearly, the induction property is true for n = 0. Assume we have 
built the system on interval [0, s„]. Define now s„+i = s„ -I- -I- We 

chose i5„+i such that 

a{s + (5„+i) — a{s) > R(sn) for all s > 0 (1.36) 

This is possible from the last condition in the Theorem. The system is defined on 
]s„,s„+i] by (Figure 1.29) 

R(t) = R*{t) = R(Sn) for Sn < t < Sn + <5„+l 
^ — R{s^^ -f Q.{t Syi ()| 2 _|-i) for Syi -f ^ f ^ 

R (f) — R(^Syi) -f (rr A /f)(f ^^n-t-l) fttr Sn T ^n-t-l ^ t ^ -^n-t-l 

, R* {sn+l) = R{Sn+l) 

We show now that the arrival curve constraint is satisfied for the system defined 
on [0,s„_|_i]. Consider R(t) — R{v) for t and v in [0,s„_|_i]. If both t < Sn and 

V < s„, or if both t > Sn and v > Sn then the arrival curve property holds from 
our construction and the induction property. We can thus assume that t > s n and 

V < Sn- Clearly, we can even assume that t > Sn + Sn+i, otherwise the property is 
trivially true. Let us rewrite t = Sn + 5n+i + s. We have, from our construction: 

R{t) — R(v) = R(Sn + Sn+l + s) — R(v) = R(Sn) + Ct(s) ~ R(v) < R(Sn) + Ct(s) 

Now from Equation (1.36), we have: 

R(sn) + a{s) < a(s + 5„+i) < a(s + Sn+i + Sn - v) = a(t - v) 
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Figure 1.29: Step 2 of the proof of Theorem 1 .4.5: a system that attains the 
output bound for all values n g N. 

which shows the arrival curve property. 

Using the same arguments as in step 1, it is simple to show that the system is 
causal, has /? as a service curve, and that 

^ (^n+l T fn+l) ^ (^n+l) — rr 

which ends the proof that the induction property is also true for n + 1. 

Step 3: Consider, as in step 2, a sequence of times ■■■ (not necessarily 

increasing). We now extend the result in step 2 to the case where the supremum in 
the dehnition of a* = {a <Z) P){tn) is not necessarily attained. Assume first that 
a* (tn) is finite for all n. For all n and all m G N* there is some Um,n such that 

T ‘^m,n} ^ rr {tn} (1-37) 

TO 

Now the set of all couples (to , n) is enumerable. Consider some numbering {M{i),N{i}), 
i G N for that set. Using the same construction as in step 2, we can build by in- 
duction on i a sequence of increasing intervals [0, s,] and a system {R, R*) that is 
causal, has a as an arrival curve for the input, has /3 as a service curve, and such that 

R*(si) - R*{si - tN(i)} > a*{tN(i)} - 

Now consider an arbitrary, but fixed n. By applying the previous equations to all i 
such that N{i) = n, we obtain 

{R 0 R }{tn} ^ SUPj that N (i) — n {tN(i)} M(i) ^ 

Ct {tn} iuf j that N{i) — n ^ M(i) 

Now the set of all for i such that N{i) = n is N* , thus 
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i such that N(i) — n \ 

and thus {R* 0 = a* (tn), which ends the proof of step 3 in the case where 

a* (tn) is finite for all n. 

A similar reasoning can be used if a*{tn) is infinite for some tn- In that case 
replace Equation (1.37) by a{tn + Um,n) ~ /3(um,n) > m. 

Step 4: Now we conclude the proof. If time is discrete, then step 3 proves the theo- 

rem. Otherwise we use a density argument. The set of nonnegative rational numbers 
Q+ is enumerable; we can thus apply step 3 to the sequence of all elements of Q+ , 
and obtain system {R, R*), with 

{R* 0 R*){q) = a*{q) for all g G Q+ 

Function R* is right-continuous, thus, from the discussion at the end of Theo- 
rem 1.2.2, it follows that R* 0 R* is left-continuous. We now show that a* is also 
left-continuous. For all f > 0 we have: 

supa:*(s) = sup {a:(s-|-r;)— /3(r;)} = sup{sup[a:(s-|-r;)— /3(r;)]} 

s<t such that s<t and s<t 

Now 

sup alyS -I- r;) = a{t + v) 

S<t 

because a is left-continuous. Thus 

supa:*(s) = sup{a:(f -hr;) — = a* {t) 

s<t jj>0 

which shows that a is left-continuous. 

Back to the main argument of step 4, consider some arbitrary f > 0. The set Q+ 
is dense in the set of nonnegative real numbers, thus there is a sequence of rational 
numbers g„ G Q"*" , with n G N, such that qn <t and lim„_j._|_oo Qn = t. From the 
left-continuity of R* 0 R* and a* we have: 

{R* (2) R*){t) = lim {R* (2) R*){q„) = lim a*{q„)=a*{t) 

n— >-+co n— >-+co 




□ 



1.10 Bibliographic Notes 

Network calculus as has been applied to dimensioning ATM switches in [53]. A 
practical algorithm for the determination of the minimum arrival curve for ATM 
system is described in [54]. It uses the burstiness function of a flow, defined in [51] 
as follows. For any r, B{r) is the minimum b such that the flow is 7r,;>-smooth, and 
is thus the required buffer if the flow is served at a constant rate r. Note that B{r) is 
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the Legendre transform of the minimum arrival curve a of the flow, namely, B{r) = 
supj>o(o'(f) — rt) [54] gives a fast algorithm for computing B (r) . Interestingly, the 
concept is applied also to the distribution of symbols in a text. 

In [70], the concepts of arrival and service curve are used to analyze real time 
processing systems. It is shown that the service curve for a variable capacity node 
must be super-additive, and conversely, any super- additive function is a service 
curve for a variable capacity node. Compare to greedy shapers, which have a sub- 
additive service curve. This shows that, except for constant bit rate trunks, a greedy 
shaper cannot be modeled as a variable capacity node, and conversely. 

In [8], the authors consider a crossbar switch, and call r , j the rate assigned to 
the traffic from input port i to output port j. Assume that r, j < 1 for all j 
and Y,. Tij < 1 for all i. Using properties of doubly-stochastic matrices (such as 
(r, j ) is), they give a simple scheduling algorithm that guarantees that the flow from 
port i to port j is allocated a variable capacity C satisfying Cij{t) — Cij{s) > 
rij{t — s) — Sij for some s, j defined by the algorithm. Thus, the node offers a 
service curve equal to the rate-latency function j ■ 

A dual approach to account for variable length packets is introduced in [10]. It 
consists in replacing the definition of arrival curve (or cr-smoothness) by the concept 
of gi-regularity. Consider a flow of variable length packets, with cumulative packet 
length L and call T, the arrival epoch for the ith packet. The flow is said to be g- 
regular if T{j) — T(i) > g{L{j) — L{i)) for all packet numbers i < j. A theory 
is then developed with concepts similar to the greedy shaper. The theory uses max- 
plus convolution instead of min-plus convolution. The {b, r) regulator originally 
introduced by Cruz [18] is a shaper in this theory, whose output is gi-regular, with 
g{x) = . This theory does not exactly correspond to the usual concept of 

leaky bucket controllers. More specifically, there is not an exact correspondence 
between the set of flows that are gi-regular on one hand, and that are cr-smooth on 
the other. We explain why with an example. Consider the set of flows that are g- 
regular, with g{x) = The minimum arrival curve we can put on this set of flows 
is a{t) = rt + (max [10]. But conversely, if a flow is cr-smooth, we cannot guarantee 
that it is gi-regular. Indeed, the following sequence of packets is a flow that is a- 
smooth but not gi-regular: the flow has a short packet (length 1 1 < (max) at time 
Ti = 0, followed by a packet of maximum size (max at time T 2 = In fact, if a 

flow is (T-smooth, then it is gi'-regular, with g'{x) = . 

The strict service curve in Definition 1.3.2 is called “strong” service curve in 
[41]. 

1.11 Exercises 

Exercise 1.1. Compute the maximum buffer size X for a system that is initially 
empty, and where the input function is R(t) = fg r(s)ds, for the following cases. 



1. ifr(t) = a (constant) 
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2. one on-off connection with peak rate 1 Mb/s, on period 1 sec, off period r 
seconds, and trunk bit rate c = 0.5 Mb/s. 

3. if r{t) = c + c sin Lot, with trunk bit rate c > 0. 

Exercise 1.2. You have a fixed buffer of size X, that receives a data input r{f). 
Determine the output rate c that is required to avoid buffer overflow given that the 
buffer is initially empty. 

Exercise 1.3. 1. For a flow with constant bit rate c, give some possible arrival 

curves. 

2. Consider a flow with an arrival curve given by: a{t) = B, where B is con- 
stant. What does this mean for the flow ? 

Exercise 1.4. We say that a flow is {P, B) constrained if it has 7 p,s as an 
arrival curve. 

A trunk system has a buffer size of B and a trunk bitrate of P. Fill in the 
dots: (1) there is no loss if the input is {., .) constrained (2) the output is {., .) 
constrained. 

2. A {P, B) constrained flow is fed into an infinite buffer served at a rate of c. 
What is the maximum delay ? 

Exercise 1.5 (On-Off flows). 1. Assume a data flow is periodical, with period 
T, and satisfies the following: r(t) = p for 0 < i < Tq, and r(t) = 0 for 
To<t<T. 

(a) Draw R(t) = ^^r{s)ds 

(b) Find an arrival curve for the flow. Find the minimum arrival curve for 
the flow. 

(cj Find the minimum (r, b) such that the flow is (r, h) constrained. 

2. A traffic flow uses a link with bitrate P (bits/s). Data is sent as packets of 
variable length. The flow is controlled by a leaky bucket (r,b). What is the 
maximum packet size ? What is the minimum time interval between packets of 
maximum size ? 

Application: P = 2 Mb/s, r = 0.2 Mb/s; what is the required burst tolerance b 
if the packet length is 2 Kbytes ? What is then the minimum spacing between 
packets ? 

Exercise 1.6. Consider the following alternative definition of the GCRA: 

Deflnition 1.11.1. The GCRA (T, r) is a controller that takes as input a cell arrival 
time t and returns result. It has internal (static) variables X (bucket level) and 
LCT (last conformance time). 



• initially, X = 0 and LCT = 0 
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• when a cell arrives at time t, then 

if (X - t + LCT > tau) 

result = NON- CONFORMANT; 
else { 

X = max (X - t + LCT, 0) + T; 

LCT = t; 

result = CONFORMANT; 

} 



Show that the two definitions ofGCRA are equivalent. 



Exercise 1.7. 7. For the following flows and a GCRA(10, 2), give the confor- 

mant and non- conformant cells. Times are in cell slots at the link rate. Draw 
the leaky bucket behaviour assuming instantaneous cell arrivals. 

(a) 0, 10, 18, 28, 38 

(b) 0, 10, 15, 25, 35 

(c) 0, 10, 18, 26, 36 

(d) 0, 10, 11, 18, 28 

2. What is the maximum number of cells that can flow back to back with 
GCRA(T, CDVT) (maximum “clump” size) ? 

Exercise 1.8. 7. For the following flows and a GCRA(100, 500), give the con- 

formant and non- conformant cells. Times are in cell slots at the link rate. 

(a) 0, 100, no, 12, 130, 140, 150, 160, 170, 180, 1000, 1010 

(b) 0, 100, 130, 160, 190, 220, 250, 280, 310, 1000, 1030 

(c) 0, 10, 20, 300, 310, 320, 600, 610, 620, 800, 810, 820, 1000, 1010, 1020, 
1200, 1210, 1220, 1400, 1410, 1420, 1600, 1610, 1620 

2. Assume that a cell flow has a minimum spacing of 7 time units between cell 
emission times (7 is the minimum time between the beginnings of two cell 
transmissions). What is the maximum burst size for GCRA(T, r) ? What is the 
minimum time between bursts of maximum size ? 

3. Assume that a cell flow has a minimum spacing between cells of^ time units, 
and a minimum spacing between bursts ofTi. What is the maximum burst size 

Exercise 1.9. For a CBR connection, here are some values from an ATM operator: 

peak cell rate (cells/s) 100 1000 10000 100000 

CDVT (microseconds) 2900 1200 400 135 
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1. What are the {P, B) parameters in b/s and bits for each case ? How does T 
compare to r ? 

2. If a connection requires a peak cell rate of 1000 cells per second and a cell 
delay variation of 1400 microseconds, what can be done ? 

3. Assume the operator allocates the peak rate to every connection at one buffer. 
What is the amount of buffer required to assure absence of loss ? Numerical 
Application for each of the following cases, where a number N of identical 
connections with peak cell rate P is multiplexed. 

case 12 3 4 

nb of connnections 3000 300 30 3 

peak cell rate (c/s) 100 1000 10000 100000 



Exercise 1.10. The two questions in this problem are independent. 

1. An ATM source is constrained by GCRA(T = 30 slots, r = 60 slots), where 
time is counted in slots. One slot is the time it takes to transmit one cell on the 
link. The source sends cells according to the following algorithm. 

• In a first phase, cells are sent at times t{l) = 0, t{2) = 15, i(3) = 
30, . . . , t{n) = 15(n — 1) as long as all cells are conformant. In other 
words, the number n is the largest integer such that all cells sent at times 
t{i) = 15(i — 1), i < n are conformant. The sending of cell n at time 
t{n) ends the first phase. 

• Then the source enters the second phase. The subsequent cell n + 1 is 
sent at the earliest time after t{n) at which a conformant cell can be 
sent, and the same is repeated for ever. In other words, call t{k) the 
sending time for cell k, with k > n; we have then: t{k) is the earliest 
time after t{k — 1) at which a conformant cell can be sent. 

How many cells were sent by the source in time interval [0, 151] ? 

2. A network node can be modeled as a single buffer with a constant output 
rate c (in cells per second). It receives I ATM connections labeled 

Each ATM connection has a peak cell rate pi ( in cells per second) and a cell 
delay variation tolerance Ti (in seconds) for 1 < * < 7. The total input rate 
into the buffer is at least as large as Pi (which is equivalent to saying 
that it is unlimited). What is the buffer size (in cells) required for a loss-free 
operation ? 

Exercise 1.11. In this problem, time is counted in slots. One slot is the duration to 
transmit one ATM cell on the link. 

1. An ATM source Si is constrained by GCRA(T = 50 slots, r = 500 slots). 
The source sends cells according to the following algorithm. 
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• In a first phase, cells are sent at times t{l) = 0, t{2) = 10, i(3) = 
20, . . . , t{n) = 10(n — 1) as long as all cells are conformant. In other 
words, the number n is the largest integer such that all cells sent at times 
t{i) = 10(i — 1), i < n are conformant. The sending of cell n at time 
t{n) ends the first phase. 

• Then the source enters the second phase. The subsequent cell n + 1 is 
sent at the earliest time after t{n) at which a conformant cell can be 
sent, and the same is repeated for ever. In other words, call t{k) the 
sending time for cell k, with k > n; we have then: t{k) is the earliest 
time after t{k — 1) at which a conformant cell can be sent. 

How many cells were sent by the source in time interval [0, 401] ? 

2. An ATM source S 2 is constrained by both GCRA(T = 10 slots, r = 2 slots) 
and GCRA(T = 50 slots, r = 500 slots). The source starts at time 0, and has 
an infinite supply of cells to send. The source sends its cells as soon as it is 
permitted by the combination of the GCRAs. We call t{n) the time at which 
the source sends the nth cell, with t{l) = 0. What is the value oft{15) ? 

Exercise 1.12. Consider a flow R{t) receiving a minimum service curve guarantee 

(3. Assume that 

• (3 is concave and wide-sense increasing 

• the inf in R® (3 is a min 

For all t, call r{t) a number such that 

(R ® l3)(t) = R(T(t)) + (3{t - T{f)) 

Show that it is possible to choose r such that if ti < ^2 then r{ti) < ritf). 

Exercise 1.13. 1. Find the maximum backlog and maximum delay for an ATM 

GBR connection with peak rate P and cell delay variation r, assuming the 
service curve is c(t) = r(t — Tq)"'' 

2. Find the maximum backlog and maximum delay for an ATM VBR connection 
with peak rate P, cell delay variation r, sustainable cell rate M and burst 
tolerance tb (in seconds), assuming the service curve is c(t) = r(t — Tq)"*" 

Exercise 1.14. Show the following statements: 

1. Consider a (P, B) constrained flow, served at a rate c > P. The output is 
also (P, B) constrained. 

2. Assume a{) has a bounded right-handside derivative. Then the output for a 
flow constrained by a{), served in a buffer at a constant rate c > sup a' (t), 
is also constrained by a{). 
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Exercise 1.15. 1. Find the the arrival curve constraining the output for an ATM 

CBR connection with peak rate P and cell delay variation r, assuming the 
service curve is c(t) = r(t — Tq)"'' 

2. Find the arrival curve constraining the output for an ATM VBR connection 
with peak rate P, cell delay variation r, sustainable cell rate M and burst 
tolerance tb (in seconds), assuming the service curve is c(t) = r(t — To)"*" 

Exercise 1.16. Consider the figure “Derivation of arrival curve for the output of a 
flow served in a node with rate-latency service curve ”■ What can be said if to 
in the Figure is infinite, namely, if a' (t) > r for all t ? 

Exercise 1.17. Consider a series of guaranteed service nodes with service curves 
eft) = rft — What is the maximum delay through this system for a flow 

constrained by (to, h) ? 

Exercise 1.18. A flow with T-SPEC (p, M, r, h) traverses nodes 1 and 2. Node i 
offers a service curve eff) = Rft — Ti)'^ . What buffer size is required for the flow 
at node 2 ? 

Exercise 1.19. A flow with T-SPEC (p, M, r, b) traverses nodes 1 and 2. Node i 
offers a service curve eff) = Ri(t — Tf)~^ . A shaper is placed between nodes 1 and 

2. The shaper forces the flow to the arrival curve z(t) = min(i? 2 i, bt + to). 

1. What buffer size is required for the flow at the shaper ? 

2. What buffer size is required at node 2 ? What value do you find ifT\ = T 2 ? 

3. Compare the sum of the preceding buffer sizes to the size that would be re- 
quired if no reshaping is performed. 

4. Give an arrival curve for the output of node 2. 

Exercise 1.20. Prove the formula giving of paragraph “Buffer Sizing at a Re- 
shaper” 

Exercise 1.21. Is Theorem “Input-Output Characterization of Greedy Shapers” a 
stronger result than Corollary “Service Curve offered by a Greedy Shaper” ? 

Exercise 1.22. 1. Explain what is meant by “we pay bursts only once”. 

2. Give a summary in at most 15 lines of the main properties of shapers 

3. Define the following concepts by using the ® operator: Service Curve, Arrival 
Curve, Shaper 

4. What is a greedy source ? 

Exercise 1.23. 1. Show that for a constant bit rate trunk with rate c, the backlog 

at time t is given by 

W(t) = sup {R{t) — R*{s) — c(t — s)} 

S<t 
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2. What does the formula become if we assume only that, instead a constant bit 
rate trunk, the node is a scheduler offering jd as a service curve ? 

Exercise 1.24. Is it true that offering a service curve (3 implies that, during any 
busy period of length t, the amount of service received rate is at least jd{f) ? 

Exercise 1.25. A flow S{t) is constrained by an arrival curve a. The flow is fed into 
a shaper, with shaping curve a. We assume that 

a(s) = min(m + ps, b + rs) 



and 

a(s) = min(Fs, B + Rs) 

We assume that p > r, m < b and P > R. 

The shaper has a fixed buffer size equal to X > m. We require that the buffer 
never overflows. 

1. Assume that B = +oo. Find the smallest of P which guarantees that there is 
no buffer overflow. Let Pq be this value. 

2. We do not assume that B = +oo any more, but we assume that P is set to 
the value Pq computed in the previous question. Find the value {Bo,Ro) of 
{B, R) which guarantees that there is no buffer overflow and minimizes the 
cost function c{B, R) = aB + R, where a is a positive constant. 

What is the maximum virtual delay if{P, B,R) = {Pq, Bq, Rq) ? 

Exercise 1.26. We consider a buffer of size X cells, served at a constant rate of 
c cells per second. We put N identical connections into the buffer; each of the N 
connections is constrained both by GCRA(T\, t\) and GCRA(T 2 , T 2 ). What is the 
maximum value of N which is possible if we want to guarantee that there is no cell 
loss at all ? 

Give the numerical application for T\ = 0.5 ms, t\ = 4.5 ms, T 2 = 5 ms, 
T 2 = 495 ms, c = 10® cells/second, X = 10"^ cells 

Exercise 1.27. We consider a flow defined by its function R{t), with R{t) = the 
number of bits observed since time t = 0. 

1. The flow is fed into a buffer, served at a rate r. Call q{t) the buffer content 
at time t. We do the same assumptions as in the lecture, namely, the buffer is 
large enough, and is initially empty. What is the expression of q{f) assuming 
we know R(t) ? 

We assume now that, unlike what we saw in the lecture, the initial buffer 
content (at time t = OJ is not 0, but some value qo > 0. What is now the 
expression for q(t) ? 
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2. The flow is put into a leaky bucket policer, with rate r and bucket size b. This 
is a policer, not a shaper, so nonconformant bits are discarded. We assume 
that the bucket is large enough, and is initially empty. What is the condition 
on R which ensures that no bit is discarded by the policer ( in other words, 
that the flow is conformant) ? 

We assume now that, unlike what we saw in the lecture, the initial bucket 
content (at time t = Oj w not 0, but some value bo > 0. What is now the 
condition on R which ensures that no bit is discarded by the policer ( in other 
words, that the flow is conformant) ? 

Exercise 1.28. Consider a variable capacity network node, with capacity curve 
M(t). Show that there is one maximum function S*(t) such that for all 0 < s < t, 
we have 

M{t)-M{s) > S*{t-s) 

Show that S* is super-additive. 

Conversely, if a function (3 is super-additive, show that there is a variable ca- 
pacity network node, with capacity curve M (t), such that for all 0 < s < t, we have 
M{t)-M{s) >S*{t-s). 

Show that, with a notable exception, a shaper cannot be modeled as a variable 
capacity node. 

Exercise 1.29. 1. Consider apacketized greedy shaper with shaping curve a (f) = 

rt for t > 0. Assume that L(k) = kM where M is fixed. Assume that the 
input is given by R(t) = lOM for t > 0 and i?(0) = 0. Compute the 
sequence R^i){f) used in the representation of the output of the packetized 
greedy shaper, for i = 1,2,3,.... 

2. Same question if sigma(t) = (rt + 2M)l^t > 0}. 

Exercise 1.30. Consider a source given by the function 

I R(t) = B for i > 0 
\ R(t) = 0 for t < 0 

Thus the flow consists of an instantaneous burst of B bits. 

1. What is the minimum arrival curve for the flow ? 

2. Assume that the flow is served in one node that offers a minimum service curve 
of the rate latency type, with rate r and latency A. What is the maximum delay 
for the last bit of the flow ? 

3. We assume now that the flow goes through a series of two nodes, J\fi and M 2 , 
where Mi offers to the flow a minimum service curve of the rate latency type, 
with rate r, and latency Aj, for i = 1,2. What is the the maximum delay for 
the last bit of the flow through the series of two nodes ? 
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4. With the same assumption as in the previous item, call R \ (t) the function 
describing the flow at the output of node Af\ (thus at the input of node Af 2 )- 
What is the worst case minimum arrival curve for Ri ? 

5. We assume that we insert between Af\ and A /2 a “reformatter” S. The input 
to S is R\ (t). We call R[ (t) the output of S. Thus R[ (t) is now the input to 
A/ 2 - The function of the “ reformatter” S is to delay the flow R\ in order to 
output a flow R'l that is a delayed version of R. In other words, we must have 
R[ (t) = R(t — d) for some d. We assume that the reformatter S is optimal 
in the sense that it chooses the smallest possible d. In the worst case, what is 
this optimal value ofd ? 

6. With the same assumptions as in the previous item, what is the worst case 
end-to-end delay through the series of nodes A/"i,iS, A /2 ? Is the reformatter 
transparent ? 

Exercise 1.31. Let a be a good function. Consider the concatenation of a bit-by-bit 
greedy shaper, with curve a, and an L-packetizer Assume that = 0. Consider 

only inputs that are L-packetized 

1. Is this system a packetized shaper for a ? 

2. Is it a packetized shaper for a + /max ? 

3. Is it a packetized greedy shaper for a + /max ? 

Exercise 1.32. Assume that a is a good function and <t = (Tq + Iuq where uq is the 
step function with a step att = 0. Can we conclude that oq is sub-additive ? 

Exercise 1.33. Is the operator (P^) upper-semi-continuous ? 

Exercise 1.34. 1. Consider the concatenation of an L-packetizer and a network 

element with minimum service curve (3 and maximum service curve 7 . Can we 
say that the combined system offer a minimum service curve (flit) — /max)"*^ 
and a maximum service curve 7 , as in the case where the concatenation would 
be in the reverse order ? . 

2. Consider the concatenation of a GPS node offering a guarantee A ^ , an 
L-packetizer, and a second GPS node offering a guarantee \t 2 - Show that 
the combined system offers a rate-latency service curve with rate R = 
min(ri , r 2 ) and latency E = max 7 Tr 2 ) ' 

Exercise 1.35. Consider a node that offers to a flow R(t) a rate-latency service 
curve (3 = Sr^l- Assume that R(t) is L-packetized, with packet arrival times called 
Ti , T 2 , ... (and is left-continuous, as usual) 

Show that (R ® (3)(t) = miiij’iG[o,i][l?(7i) + (3(t — T,)] (and thus, the inf is 
attained). 
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Exercise 1.36. 1. Assume K connections, each with peak rate p, sustainable 

rate m and burst tolerance b, are offered to a trunk with constant service rate 
P and FIFO buffer of capacity X. Find the conditions on K for the system to 
be loss-free. 

2. If Km = P, what is the condition on X for K connections to be accepted ? 

3. What is the maximum number of connection if p = 2 Mb/s, m = 0.2 Mb/s, 
X = IDMBytes, b = 1Mbyte and P = 0.1, 1, 2 or 10 Mb/s ? 

4. For a fixed buffer size X, draw the acceptance region when K and P are the 
variables. 

Exercise 1.37. Show the formulas giving the expressions for fsiR) and fB(ct)- 

Exercise 1.38. 1. What is the effective bandwith for a connection with p = 2 

Mb/s, m = 0.2 Mb/s, b = 100 Kbytes when D = 1msec, 10 msec, 100 msec. 
Is ? 

2. Plot the effective bandwidth e as a function of the delay constraint in the 
general case of a connection with parameters p, m, b. 

Exercise 1.39. 1. Compute the effective bandwidth for a mix of VBR connec- 

tions 

2. Show how the homogeneous case can be derived from your formula 

3. Assume K connections, each with peak rate p, sustainable rate m and burst 
tolerance b, are offered to a trunk with constant service rate P and FIFO 
buffer of capacity X. Find the conditions on K for the system to be loss-free. 

4. Assume that there are two classes of connections, with Ki connections in class 
i, i = 1,2, offered to a trunk with constant service rate P and FIFO buffer 
of infinite capacity X. The connections are accepted as long as their queuing 
delay does not exceed some value D. Draw the acceptance region, that is, the 
set of{Ki , Kf) that are accepted by CAC2. Is the acceptance region convex ? 
Is the complementary of the acceptance region in the positive orthant convex 
? Does this generalize to more than two classes ? 




Chapter 2 

Application of Network 
Calculus to the Internet 



In this chapter we apply the concepts of Chapter 1 and explain the theoretical un- 
derpinnings of integrated and differentiated services. Integrated services define how 
reservations can be made for flows. We explain in detail how this framework was 
deeply influenced by GPS. In parficular, we will see fhaf if assumes fhat every router 
can be modeled as a node offering a minimum service curve that is a rate-latency 
function. We explain how this is used in a protocol such as RSVP. We also analyze 
the more efficient framework based on service curve scheduling. This allows us to 
address in a simple way the complex issue of schedulability. 

Differentiated services differ radically, in that reservations are made per class of 
service, rather than per flow. We show how the bounding results in Chapter 1 can be 
applied to find delay and backlog bounds. We also introduce the “damper”, which is 
a way of enforcing a maximum service curve, and show how it can radically reduce 
the delay bounds. 



2.1 GPS and Guaranteed Rate Schedulers 

In this section we describe GPS and its derivatives; they form the basis on which the 
Internet guaranteed model was defined. 



2.1.1 Packet Scheduling 

A guaranteed service network offers delay and throughput guarantees to flows, pro- 
vided fhaf the flows satisfy some arrival curve constraints (Section 2.2). This re- 
quires that network nodes implement some form of packet scheduling, also called 
service discipline. Packet scheduling is defined as the function that decides, at every 
buffer inside a network node, the service order for different packets. 
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A simple form of packet scheduling is FIFO: packets are served in the order of 
arrival. The delay bound, and the required buffer, depend on the minimum arrival 
curve of the aggregate flow (Section 1.8 on page 64). If one flow sends a large 
amount of traffic, then the delay increases for all flows, and packet loss may occur. 
Thus FIFO scheduling requires that arrival curve constraints on all flows be strictly 
enforced at all points in the network. Also, with FIFO scheduling, the delay bound 
is the same for all flows. We study FIFO scheduling in more detail in Section 6. 

An alternative [21, 39] is to use per flow queuing, in order to (1) provide iso- 
lation to flows and (2) offer different guarantees. We consider hrst the ideal form 
of per flow queuing called “Generalized Processor Sharing” (GPS) [56], which was 
already mentioned in Chapter 1 . 

2.1.2 GPS and a Practical Implementation (PGPS) 

A GPS node serves several flows in parallel, and has a total output rate equal to c b/s. 
A flow i is allocated a given weight, say Call Ri{t), R*{t) the input and output 
functions for flow i. The guarantee is that at any time t, the service rate offered to 
flow i is 0 is flow i has no backlog (namely, if Ri{t) = R*{t)), and otherwise is 
equal to ^ — j-c, where B{t) is the set of backlogged flows at time t. Thus 

R*(t) = [ ^ — T~^{ieB(s)}ds 

Jo 2^jeB(s) ‘ri 

In the formula, we used the indicator function 1 {expr} > which is equal to 1 if expr 
is true, and 0 otherwise. 

It follows immediately that the GPS node offers to flow i a service curve equal to 
Xnc, with Vi = . It is shown in [57] that a better service curve can be obtained 

for every flow if we know some arrival curve properties for all flows; however the 
simple property is sufficient to understand the integrated service model. 

GPS satisfies the requirement of isolating flows and providing differentiated 
guarantees. We can compute the delay bound and buffer requirements for every 
flow if we know its arrival curve, using the results of Chapter 1 . However, a GPS 
node is a theoretical concept, which is not really implementable, because it relies on 
a fluid model, and assumes that packets are inhnitely divisible. How can we make a 
practical implementation of GPS ? One simple solution would be to use the virtual 
hnish times as we did for the buffered leaky bucket controller in Section 1.7.3: for 
every packet we would compute its hnish time 6 under GPS, then at time 9 present 
the packet to a multiplexer that serves packets at a rate c. Figure 2.1 (left) shows the 
hnish times on an example. It also illustrates the main drawback that this method 
would have: at times 3 and 5, the multiplexer would be idle, whereas at time 6 it 
would have a burst of 5 packets to serve. In particular, such a scheduler would not 
be work conserving. 

This is what motivated researchers to hnd other practical implementations of 
GPS. We study here one such implementation of GPS, called packet by packet 
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generalized processor sharing (POPS) [56]. Other implementations of GPS are dis- 
cussed in Section 2.1.3. 

POPS emulates GPS as follows. There is one FIFO queue per flow. The sched- 
uler handles packets one at a time, until it is fully transmitted, at the system rate 
c. For every packet, we compute the finish time that it would have under GPS (we 
call this the “GPS-finish-time”). Then, whenever a packet is finished transmitting, 
the next packet selected for transmission is the one with the earliest GPS-finish- 
time, among all packets present. Figure 2.1 shows one example. We see that, unlike 
the simple solution discussed earlier, PGPS is work conserving, but does so at the 
expense of maybe scheduling a packet before its finish time under GPS. 




flow 0 
1 
2 

3 

4 

5 



▼ Arrival 
^ Departure 



Figure 2.1: Scheduling with GPS (left) and PGPS (right). Flow 0 has weight 
0.5, flows 1 to 5 have weight 0.1. All packets have the same transmission 
time equal to 1 time unit. 



We can quantify the difference between PGPS and GPS in the following propo- 
sition. In Section 2.1.3, we will see how to derive a service curve property. 

Proposition 2.1.1 ([56]). The finish time for PGPS is at most the finish time of GPS 
plus where c is the total rate and L is the maximum packet size. 

Proof: Call D{n) the finish time of the nth packet for the aggregate input flow 

under PGPS, in the order of departure, and 6{n) under GPS. Call no the number of 
the packet that started the busy period in which packet n departs. Note that PGPS 
and GPS have the same busy periods, since if we observe only the aggregate flows, 
there is no difference between PGPS and GPS. 

There may be some packets that depart before packet n in PGPS, but that 
nonetheless have a later departure time under GPS. Call mo > no the largest packet 
number for which this occurs, if any; otherwise let mo = no — 1. In this proposi- 
tion, we call l(m) the length in bits of packet m. Under PGPS, packet mo started 
service at D{mo) — , which must be earlier than the arrival times of packets 

m = mo + 1, ..., n. Indeed, otherwise, by definition of PGPS, the PGPS scheduler 
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would have scheduled packets m = toq + I, ■■■,n before packet toq- Now let us 
observe the GPS system. Packets m = nio + 1, ...,n depart no later than packet 
n, by definition of rtio', they have arrived after D{mo) — By expressing the 

amount of service in the interval [D{mo) — , 9{n)] we find thus 

l{m) < c (oin) - D{mo) + 

m=mo + l ^ ^ 

Now since packets nio, •••, n are in the same busy period, we have 
D(n) = D(mo) + 

c 

By combining the two equations above we find D{n) < 6{n) + , which shows 

the proposition in the case where toq < ?^o- 

If Too = no — 1, then all packets no , • • • , n depart before packet n under GPS and 
thus the same reasoning shows that 

n 

l(m) < c (0(n) — to) 

m=no 



where to is the beginning of the busy period, and that 



D{n) 



to H 

c 



Thus D{n) < 6{n) in that case. 



□ 



2.1.3 Guaranteed Rate Schedulers 

A large number of practical implementations of GPS, other than PGSP, have been 
proposed in the literature; let us mention: virtual clock scheduling [43], packet by 
packet generalized processor sharing [56] and self-clocked fair queuing [35](see 
also [26]). For a thorough discussion of practical implementations of GPS, see [73, 
26]). These implementations differ in their implementation complexity and in the 
bounds that can be obtained. It is shown in [27] that all of these implementations fit 
in the following framework, called “Guaranteed Rate” (GR) scheduling, which we 
define in a few lines. 

The definition relies on a max-plus representation of the server with constant bit 
rate c, with packet inputs. Such a system can be modeled as the concatenation of the 
greedy shaper with curve and a packetizer, but we use a different representation. 

Proposition 2.1.2 (Max-plus representation of CBR server). Consider a FIFO 
server with constant bit rater, which serves a flow of packets numbered i = 1 , 2 ,.... 
Call Ai,Tl, li the arrival time, departure time, and length in bits for packet i. Ai- 
sume Ai > 0. Then is defined by the following recursion. 
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f T' = 0 

\ T/ = max { Aj, T/_j } + 7 for alH > 1 



(2.1) 



Proof: The FIFO property implies that packet i starts its service at max | A, , T-_ j } . 

□ 

Definition 2.1.1 ([27]). A scheduling policy is of the guaranteed rate type, with rate 
r and delay v for a given flow, if it guarantees that the departure time D j for packet 
i (i > 1) satisfies Di < T^ + v, where T^ is defined by Equation (2.1). 

The variables T- are called “Guaranteed Rate Clocks”. It follows immediately 
from Proposition 2.1.1 that POPS is a GR scheduler for every flow i, with rate 
and delay r; = 

Consider now a GR scheduler with rate r and delay v. Assume the input R{t) 
is L-packetized. Call R'{t) the output. The constant bit rate server, shifted by time 
V, can be expressed as the concatenation of a greedy shaper, shifted by v, and a 
packetizer. We can thus rewrite Definition 2.1.1 as 

R'{t)>P^[{l3r,^^R){t)] (2.2) 

This property is sufficient to obtain bounds such as Corollary 2.1.1 and Equa- 
tion (2.5). However, for the sake of elegance, we give a slightly stronger result, 
at the expense of some technicalities. 

Theorem 2.1.1. Consider a GR scheduler with rate r and delay v. Assume the input 
R(t) is L-packetized. The node is the concatenation of a service curve element, with 
service curve equal to the rate-latency function ldr,v> tmd an L-packetizer. 

The theorem means that we can find some S G (F satisfying R'{t) = P^{S{t)) 
and S > Pr,v ® R- Note that this obviously implies Equation (2.2), but the converse 
is not obvious. 

Proof: Consider the virtual system S whose output S{t) is defined by 

if A-i <t<Di 

then S(t) = min{i?(f), max[L(i — l),L(i) — r{Di — f)]} 

See Eigure 2.2 for an illustration. It follows immediately that R'{f) = P^{S(t)). 
Also consider the virtual system 5° whose output is 

S^{t) = R){t) 

iS° is the constant rate server, delayed by v. Our goal is now to show that S > S^. 

Call D^ the departure time of the last bit of packet i in No (see Eigure 2.2 for 
an example with i = 2). Let u = D^ — D^. The definition of GR scheduling means 
that u > 0. Now since No is a shifted constant rate server, we have: 
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if D° <s<D° then 5°(s) = L(i) - r(D° - s) 

Also <D^-‘f thus - ^) = L{i - 1) and 

if s < — — then S^{s) < L{i — 1) 

It follows that 

if Di-i + u < s < then S^{s) < max[L(i — l),L{i) — r{D^ — s)] (2.4) 

Consider now some t G (Dj_i,Dj] and let s = t + u. If S{t) = R{t), since 
R > S^, we then obviously have S{t) > S^{t). Else, from Equation (2.1), S{t) = 
max[L(i — 1), L{i) — r{Di — t)]. We have — s = Di — t and thus, combining 
with Equation (2.4), we derive that S^{s) < S{t). Now s > t, thus finally S^{t) < 
S(t). □ 




Figure 2.2: Arrival and departure functions for GR scheduling. The virtual 
system output is S{t). 



By applying Theorem 1.7.1, we obtain 
Corollary 2.1.1. A GR scheduler offers a minimum service curve (3^ imax 

Consider now a concatenation of GR schedulers with rates and delay Vm, 
m = 1, By Theorem 1.7.1, we know that for the computation of the end-to- 

end delay, we may ignore the last packetizer. Thus, we can consider the end-to-end 
service curve given by the rate-latency function with rate r = min fm and 

latency Vm + (max constrained by one leaky bucket 

with parameter (p, a), then, by applying Theorem 1.4.2, we find that for p < r, a 
bound on the end-to-end delay is [27] 



M M-1 ^ 

B — ^ ^ Vfii -f (max ^ ^ f 

m— 1 m— 1 



a 

min^ 



(2.5) 



The above equation is a generalization of Equation (1.23). 
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2.2 The Integrated Services Model of the IETF 

2.2.1 The Guaranteed Service 

The Internet supports different reservation principles. Two services are defined: the 
“guaranteed” service, and the “ controlled load” service. They differ in that the for- 
mer provides real guarantees, while the latter provides only approximate guarantees. 
We outline the differences in the rest of this section. In both cases, the principle is 
based on “admission control”, which operates as follows. 

• In order to receive the guaranteed or controlled load service, a flow must first 
perform a reservation during a flow setup phase. 

• A flow must confirm to an arrival curve of the form a{t) = min(M +pt, rt + 
b), which is called the T-SPEC (see Section 1.2.2 on pagel6). The T-SPEC is 
declared during the reservation phase. 

• All routers along the path accept or reject the reservation. With the guaranteed 
service, routers accept the reservation only if they are able to provide a service 
curve guarantee and enough buffer for loss-free operation. The service curve 
is expressed during the reservation phase, as explained below. 

Eor the controlled load service, there is no strict definition of what accepting 
a reservation means. Most likely, it means that the router has an estimation 
module that says that, with good probability, the reservation can be accepted 
and little loss will occur; there is no service curve or delay guarantee. 

In the rest of this chapter we focus on the guaranteed service. Provision of the 
controlled load service relies on models with loss, which are discussed in Chapter 9. 

2.2.2 The Integrated Services Model for Internet Routers 

The reservation phase assumes that all routers can export their characteristics using a 
very simple model. The model is based on the view that an integrated services router 
implements a practical approximation of GPS, such as POPS, or more generally, a 
GR scheduler. We have shown in Section 2. 1 .3 that the service curve offered to a 
flow by a router implementing GR is a rate-latency function, with rate R and latency 
T connected by the relationship 

T=^+D (2.6) 

with C = the maximum packet size for the flow and D = ^, where L is the maxi- 
mum packet size in the router across all flows, and c the total rate of the scheduler. 
This is the model defined for an Internet node [67]. 

Fact 2.2.1. The Integrated Services model for a router is that the service curve 
offered to a flow is always a rate-latency function, with parameters related by a 
relation of the form (2.6). 
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The values of C and D depend on the specific implementation of a router, see 
Corollary 2.1.1 in the case of GR schedulers. Note that a router does not necessar- 
ily implement a scheduling method that approximates GPS. In fact, we discuss in 
Section 2.3 a family of schedulers that has many advantages above GPS. If a ronter 
implements a method that largely differs from GPS, then we must find a service 
curve that lower-bounds the best service curve guarantee offered by the ronter. In 
some cases, this may mean loosing important information abont the router. For ex- 
ample, it is not possible to implement a network offering constant delay to flows 
by means of a system like SCED-t, discussed in Section 2.4.4, with the Integrated 
Services router model. 



2.2.3 Reservation Setup with RSVP 



Consider a flow defined by TSPEC {M, p, r, b), that traverses nodes 1, . . . , A^. Usn- 
ally, nodes 1 and N are end-systems while nodes n for 1 < n < are routers. The 
Integrated Services model assnmes that node n on the path of the flow offers a rate 
latency service curve , and further assumes that T„ has the form 

T — — + D 

where and E>„ are constants that depend on the characteristics of node n. 

The reservation is actually put in place by means of a flow setup procedure such 
as the resource reservation protocol (RSVP). At the end of the procedure, node n on 
the path has allocated to the flow a value > r. This is equivalent to allocating a 
service curve • From Theorem 1.4.6 on page 34, the end-to-end service cnrve 

offered to the flow is the rate-latency function with rate R and latency T given by 

I i? = min„^i.. jv 

1 r = E,t. (fc + D») 

Let C^ot = 'I2n^i and Dn- We can re-write the last equation as 



T = 



%^ + Aot-E^, 

n=l 



with 




The term is called the “local slack” term at node n. 
Erom Proposition 1.4.1 we deduce immediately: 



(2.7) 



(2.8) 



Proposition 2.2.1. If R > r, the bound on the end-to-end delay, under the condi- 
tions described above is 



b-M fp-R\~^ ^ M-\-Ctot , o 

H ;; r L>tot — 2 _^ 

n=l 



R 



p — r 



R 



(2.9) 
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We can now describe the reservation setup with RSVP. Some details of flow 
setup with RSVP are illustrated on Figure 2.3. It shows that two RSVP flows are 
involved: an advertisement (PATH) flow and a reservation (RESV) flow. We describe 
first the point-to-point case. 

• A PATH message is sent by the source; it contains the T-SPEC of the flow 
(source T-SPEC), which is not modified in transit, and another field, the AD- 
SPEC, which is accumulated along the path. At a destination, the ADSPEC 
field contains, among others, the values of Cjoti ^tot used in Equation 2.9. 
PATH messages do not cause any reservation to be made. 

• RESV messages are sent by the destination and cause the actual reservations 
to be made. They follow the reverse path marked by PATH messages. The 
RESV message contains a value, R', (as part of the so-called R-SPEC), which 
is a lower bound on the rate parameters that routers along the path will 
have to reserve. The value of i?' is determined by the destination based on the 
end-to-end delay objective , following the procedure described below. It 
is normally not changed by the intermediate nodes. 



□ 



□ 



R e c e iv e r 



1 . path m essage 


2 . path m essage 


3 .path m essage 


2K 
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4K 


Receiver TSPEC = 
2K,10M b/s,512kb/s,2 
'll^SPEC = (622 kb/s) 


lleceiver TSPEC- 
2K,10M b/s,512kb/s, 
ll^SPEC = (622 kb/s) 


Receiver TSPEC = 
2K,10M b/s,512kb/s,: 
R-SPEC = (622 kb/s) 



Figure 2.3: Setup of Reservations, showing the PATH and RESV flows 



Define function / by 






b-M 

R' 



p-R' 
p — r 



Af + Cfot 

+ ^ 7 ^ + Dtot 



In other words, / is the function that defines the end-to-end delay bound, assuming 
all nodes along the path would reserve = R' . The destination computes R' 
as the smallest value > r for which f{R') < dobj- Such a value exists only if 

^tot ^ ^obj * 
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In the figure, the destination requires a delay variation objective of 600 ms, 
which imposes a minimum value of R' =622 kb/s. The value of R' is sent to the 
next upstream node in the R-SPEC field of the PATH message. The intermediate 
nodes do not know the complete values Cjot T>tot’ they know the total 

delay variation objective. Consider the simple case where all intermediate nodes are 
true POPS schedulers. Node n simply checks whether it is able to reserve Rn = R' 
to the flow; this involves verifying that the sum of reserved rates is less than the 
scheduler total rate, and that there is enough buffer available (see below). If so, it 
passes the RESV message upstream, up to the destination if all intermediate nodes 
accept the reservation. If the reservation is rejected, then the node discards it and 
normally informs the source. In this simple case, all nodes should set their rate to 
Rn = R' thus R = R', and Equation (2.9) guarantees that the end-to-end delay 
bound is guaranteed. 

In practice, there is a small additional element (use of the slack term), due to the 
fact that the designers of RS VP also wanted to support other schedulers. It works as 
follows. 



□ 



□ 



R e c e iv e r 



1 . path m essage 


2 . path m essage 


3 . path tn essage 


2K 


TSPEC- * 

2K.10M b/s,5 12kb/s,: 
AdSpec= 0 


^gnder TSPEC- * 

2K,10M b/s,5 12kb/s, 
AdSpec= (10 .2s/kb/s 
0 .05s) 


^^^nder TSPEC = 

2K,10M b/s,5 12kb/s,3 
AdSpec= (51 .2 , 0 .1 ) 




4 . B requests guaranteed QoS 
reservation w ith delay variation 
l.Os; B reserves 512kb/s 


6 . resv m essage 


6 . resv m essage 


5 . resv m essage 


4K 


Receiver TSPEC- 
2K,10M b/s,5 12kb/s,2 
•k^SPEC = (512kb/s , S: 
"^0 .2 8 8s) 


lleceiver TSPEC- 
2K,10M b/s,5 12kb/s, 
“k^SPEC = (512kb/s , S 
0 .288s) 


Receiver TSPEC = 
2K.10M b/s,5 12kb/s,: 
R-SPEC = (512 kb/s, 
S=0 .188s) 



Eigure 2.4: Use of the slack term 



There is another term in the R-SPEC, called the slack term. Its use is illustrated 
on Figure 2.4. In the figure, we see that the end-to-end delay variation requirement, 
set by the destination, is 1000 ms. In that case, the destination reserves the minimum 
rate, namely, 512 kb/s. Even so, the delay variation objective D obj is larger than the 
bound Dmax given by Formula (2.9). The difference D obj — Dmax is written in the 
slack term S and passed to the upstream node in the RESV message. The upstream 
node is not able to compute Formula (2.9) because it does not have the value of the 
end-to-end parameters. However, it can use the slack term to increase its internal 
delay objective, on top of what it had advertised. For example, a guaranteed rate 
scheduler may increase its value of v (Theorem 2.1.1) and thus reduce the internal 
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resources required to perform the reservation. The figure shows that R1 reduces the 
slack term by 100 ms. This is equivalent to increasing the Died parameter by 100ms, 
but without modifying the advertised Dfot. 

The delays considered here are the total (fixed plus variable) delays. RSVP also 
contains a field used for advertising the fixed delay part, which can be used to com- 
pute the end-to-end fixed delay. The variable part of the delay (called delay jitter) is 
then obtained by subtraction. 



2.2.4 A Flow Setup Algorithm 

There are many different ways for nodes to decide which parameter they should 
allocate. We present here one possible algorithm. A destination computes the worst 
case delay variation, obtained if all nodes reserve the sustainable rate r. If the result- 
ing delay variation is acceptable, then the destination sets R = r and the resulting 
slack may be used by intermediate nodes to add a local delay on top of their ad- 
vertised delay variation defined by C and D. Otherwise, the destination sets R to 
the minimum value Rmin that supports the end-to-end delay variation objective and 
sets the slack to 0. As a result, all nodes along the path have to reserve Rmin- As 
in the previous cases, nodes may allocate a rate larger than the value of R they pass 
upstream, as a means to reduce their buffer requirement. 

Definition 2.2.1 (A Flow Setup Algorithm). • At a destination system I, com- 
pute 

Dmax = /T(r) -I h Dfot 

r 

If Dokj > Dmax then assign to the flow a rate Ri = r and an additional 
delay variation dj < Dgbj — Dmax! set Sj = Dgbj — Dmax ~ di ^nd send 
reservation request Rj, Si to station 7 — 1. 

Else (Dobj < Dmax) flnd the minimum Rmin such that friRmin) + ^*°* < 
Dobj — Dfot, if it exists. Send reservation request Rj = Rmin ,Si = 0 to 
station I — 1. If Rmin does not exist, reject the reservation or increase the 
delay variation objective Dobj- 

• At an intermediate system i: receive from i-\-\ a reservation request Ri^i , Sj-i-i. 

If Si = 0, then perform reservation for rate 7?j+i and if successful, send 
reservation request Ri = Ri+i, 5, = 0 to station i — 1. 

Else (Si > 0), perform a reservation for rate Ri^i with some additional delay 
variation di < if successful, send reservation request Ri = 7?,_|_i,5j = 

— di to station i — 1. 



The algorithm ensures a constant reservation rate. It is easy to check that the end 
to end delay variation is bounded by Dobj- 
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2.2.5 Multicast Flows 

Consider now a multicast situation. A source S sends to a number of destinations, 
along a multicast tree. PATH messages are forwarded along the tree, they are dupli- 
cated at splitting points; at the same points, RESV messages are merged. Consider 
such a point, call it node i, and assume it receives reservation requests for the same 
T-SPEC but with respective parameters and The node performs 

reservations internally, using the semantics of algorithm 3. Then it has to merge the 
reservation requests it will send to node i — 1. Merging uses the following rules: 

R-SPEC Merging Rules The merged reservation R, S is given by 

R = max(i?', R”) 

S = minis', S”) 

Let us consider now a tree where algorithm 3 is applied. We want to show that 
the end-to-end delay bounds at all destinations are respected. 

The rate along the path from a destination to a source cannot decrease with this 
algorithm. Thus the minimum rate along the tree towards the destination is the rate 
set at the destination, which proves the result. 

A few more features of RSVP are: 

• states in nodes need to be refreshed; if they are not refreshed, the reservation 
is released (“soft states”). 

• routing is not coordinated with the reservation of the flow 

We have so far looked only at the delay constraints. Buffer requirements can be 
computed using the values in Proposition 1.4.1. 

2.2.6 Flow Setup with ATM 

With ATM, there are the following differences: 

• The path is determined at the flow setup time only. Different connections may 
follow different routes depending on their requirements, and once setup, a 
connection always uses the same path. 

• With standard ATM signaling, connection setup is initiated at the source and 
is confirmed by the destination and all intermediate systems. 

2.3 Schedulability 

So far, we have considered one flow in isolation and assumed that a node is able 
to offer some scheduling, or service curve guarantee. In this section we address the 
global problem of resource allocation. 
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When a node performs a reservation, it is necessary to check whether local re- 
sources are sufficient. In general, the method for this consists in breaking the node 
down into a network of building blocks such as schedulers, shapers, and delay ele- 
ments. There are mainly two resources to account for: bit rate (called “bandwidth”) 
and buffer. The main difficulty is the allocation of bit rate. Following [31], we will 
see in this section that allocating a rate amounts to allocating a service curve. It is 
also equivalent to the concept of schedulability. 

Consider the simple case of a POPS scheduler, with outgoing rate C. If we want 
to allocate rate r, to flow i, for every i, then we can allocate to flow i the GPS weight 
(pi = ^. Assume that 

Y,ri<C ( 2 . 10 ) 

i 

Then we know from Proposition 2.1.1 and Corollary 2.1.1 that every flow i is guar- 
anteed the rate-latency service curve with rate r , and latency In other words, the 
schedulability condition for POPS is simply Equation (2.10). However, we will see 
now that a schedulability conditions are not always as simple. Note also that the 
end-to-end delay depends not only on the service curve allocated to the flow, but 
also on its arrival curve constraints. 

Many schedulers have been proposed, and some of them do not fit in the GR 
framework. The most general framework in the context of guaranteed service is 
given by SCED (Service Curve Earliest Deadline hrst) [31], which we describe now. 
We give the theory for constant size packets and slotted time; some aspects of the 
general theory for variable length packets are known [10], some others remain to be 
done. We assume without loss of generality that every packet is of size 1 data unit. 

2.3.1 EDF Schedulers 

As the name indicates, SCED is based on the concept of Earliest Deadline First 
(EDF) scheduler. An EDF scheduler assigns a deadline Df to the nth packet of flow 
i, according to some method. We assume that deadlines are wide-sense increasing 
within a flow. At every time slot, the scheduler picks at one of the packets with the 
smallest deadline among all packets present. There is a wide variety of methods 
for computing deadlines. The “delay based” schedulers [49] set D " = A" -|- d, 
where A" is the arrival time for the nth packet for flow i, and d, is the delay budget 
allocated to flow i.lfdi is independent of i, then we have a FIFO scheduler. We will 
see that those are special cases of SCED, which we view as a very general method 
for computing deadlines. 

An EDF scheduler is work conserving, that is, it cannot be idle if there is at 
least one packet present in the system. A consequence of this is that packets from 
different flows are not necessarily served in the order of their deadlines. Consider for 
example a delay based scheduler, and assume that flow 1 has a Irage delay budget d i , 
while flow 2 has a small delay budget d 2 . It may be that a packet of flow 1 arriving 
at t\ is served before a packet of flow 2 arriving at f 2 , even though the deadline of 
packet 1, t\ + di is larger than the deadline of packet 2. 
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We will now derive a general schedulability criterion for EDF schedulers. Call 
Ri {t), t G N, the arrival function for flow i. Call Zi{t) the number of packets of 
flow i that have deadlines < t. For example, for a delay based scheduler, Zi{t) = 
Ri{t — di). The following is a modified version of [10]. 

Proposition 2.3.1. Consider an EDF scheduler with I flows and outgoing rate C. 
A necessary condition for all packets to be served within their deadlines is 

I 

for all s < f : ^ Zflf) — Rfls) < C(t — s) (2.11) 



A sufficient condition is 



I 

for all s < f : — Ri{s)]~^ < C(t — s) (2.12) 

Proof: We first prove the necessary condition. Call R\ the output for flow i. Since 

the scheduler is work conserving, we have = Ac ® Ri)- Now 

i?' > Zi by hypothesis. Thus 

I I 

inf C'(f-s) + Vi?i(s) 
which is equivalent to Equation (2. 1 1) 

Now we prove the sufficient condition, by contradiction. Assume that at some 
t a packet with deadline t is not yet served. In time slot t, the packet served has a 
deadline < t, otherwise our packet would have been chosen instead. Define s o such 
that the time interval [sq + l,f] is the maximum time interval ending at t that is 
within a busy period and for which all packets served have deadlines < t. 

Now call S the set of flows that have a packet with deadline < t present in the 
system at some point in the interval [sq + 1, f]- We show that if 

if i e iS then R'flso) = Ri{so) (2.13) 

that is, flow i is not backlogged at the end of time slot sq- Indeed, if sq + 1 is the 
beginning of the busy period, then the property is true for any flow. Otherwise, we 
proceed by contradiction. Assume that i G S and that i would have some backlog at 
the end of time slot sq- At time sq some packet with deadline > t was served; thus 
the deadline of all packets remaining in the queue at the end of time slot s o must 
have a deadline > t. Since deadlines are assumed wide-sense increasing within a 
flow, all deadlines of flow i packets that are in the queue at time s o > or will arrive 
later, have deadline > t, which contradicts that i G S. 

Further, it follows from the last argument that if i € 5, then all packets served 
before or at t must have a deadline < t. Thus 
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if i e iS then < Zi{t) 

Now since there is at least one packet with deadline < t not served at t, the 
previous inequality is strict for at least one i in S. Thus 

(2.14) 

ies ies 

Observe that all packets served in [sq + 1, f] rnust be from flows in S. Thus 
/ 

- i?'(so)) = 

i—1 iES 

Combining with Equation (2.13) and Equation (2.14) gives 

/ 

J2(R'M - i?'(so)) < - Ri(so)) 

i—1 iES 

Now [so + l,t] is entirely in a busy period thus — i?'(so)) = C(t — sq); 

thus 

/ 

C{t-so) < Y,iZi{t)-Ri{so)) = Y.{Zi{t)-Ri{so))+ < Y,iZi{t)-Ri{so))+ 

iES iES i—1 

which contradicts Equation (2.12). □ 

A consequence of the proposition that if a set of flows is schedulable for some 
deadline allocation algorithm, then it is also schedulable for any other deadline al- 
location method that produces later or equal deadlines. Other consequences, of im- 
mediate practical importance, are drawn in the next section. 

2.3.2 SCED Schedulers [65] 

Given, for all i, a function /3,, SCED defines a deadline allocation algorithm that 
guarantees, under some conditions, that flow i does have /3, as a minimum service 
curve'. Roughly speaking, SCED sets Zi(t), the number of packets with deadline 
up to t, to {Ri ® Pi){t). 

Definition 2.3.1 (SCED). Call Af the arrival time for packet n of flow i. Define 
functions Rf by: 

R?(t) = inf [Ri{s) +I3i{t- s)] 

«G[oAr] 

With SCED, the deadline for packet n of flow i is defined by 

D” = (i?”)-'(n) = min{f € N : > n) 

Function /3i is called the ‘'target service curve” for flow i. 

^We use the original work in [65], which is called there “SCED-B”. For simplicity, we call it SCED. 
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Function Rf is similar to the min-plus convolution Ri® fli, but the minimum is 
computed over all times up to . This allows to compute a packet deadline as soon 
as the packet arrives; thus SCED can be implemented in real time. The deadline 
is obtained by applying the pseudo-inverse of Rf, as illustrated on Figure 2.5. If 
Pi = ^di > then it is easy to see that Df = Af + di, namely, SCED is the delay based 
scheduler in that case. The following proposition is the main property of SCED. It 




Figure 2.5: Definition of SCED. Packet n of flow i arrives at time Af. Its dead- 
line is Df. 

shows that SCED implements a deadline allocation method based on service curves. 

Proposition 2.3.2. For the SCED scheduler, the number of packets with deadline 
< t is given by Zft) = [(i?, ® A)(f)J 

Proof: We drop index i in this demonstration. First, we show that Z{t) > [(i? (8) 

P){t)\ . Let n = [{R (8 P){t)\ ■ Since i? 8 /? < i? and R takes integer values, we 
must have R{t) > n and thus A” < t. Now R'^(f) > (i? 8 P){t) thus 

> {R®P){f) > n 

By definition of SCED, 79" this implies that 79" < t which is equivalent to Z(f) > 
n. 

Conversely, for some fixed but arbitrary t, let now n = Z(f). Packet n has a 
deadline < t, which implies that A" < t and for all s € [0, A^] : 

R{s) + Pit - s) > n (2.15) 

Now for s e we have R{s) > n thus R{s) + P{t — s) > n. Thus Equa- 
tion (2.15) is true for all s G [0, t], which means that (7? 8 P)(t) > n. □ 
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Theorem 2.3.1 (Schedulability of SCED, ATM). Consider a SCED scheduler with 
I flows, total outgoing rate C, and target service curve (3 i for flow i. 

1- If 

I 

!3i(t) < Ct for alH > 0 (2.16) 

then every packet is served before or at its deadline and every flow i receives 
\_f3i\ as a service curve. 

2. Assume that in addition we know that every flow i is constrained by an arrival 
curve ai. If 

I 

^(q!, ® A)(i) < Ct for alH > 0 (2.17) 

then the same conclusion holds 



Proof: 

1. Proposition 2.3.2 implies that Zflf) < Rfls) + flflt — s) for 0 < s < f. Thus 
Zi(t) — Ri(s) < (3ft — s). Now 0 < (3i{t — s) thus 

[Zflf) - i?*(s)]+ = max[Zj(f) - 0] < Pflt - s) 

By hypothesis, ~ s) < C{t — s) thus by application of Proposi- 

tion 2.3.1, we know that every packet is served before or at its deadline. Thus 
R\ > Zi and from Proposition 2.3.2: 

R'i>Zi = i(3i ® Ri\ 

Now Ri takes only integer values thus [/3, ® Ri\ = \_(3i\ ® Ri. 

2. By hypothesis, R^ = Ri thus Z, = [a, ® (3i® i?,J and we can apply 

the same argument, with (3 i instead of /),. □ 

Schedulability of delay based schedulers A delay based scheduler assigns a de- 
lay objective d, to all packets of flow i. A direct application of Theorem 2.3.1 gives 
the following schedulability condition. 

Theorem 2.3.2 ([49]). Consider a delay based scheduler that serves I flows, with 
delay di assigned to flow i. All packets have the same size and time is slotted. Assume 
flow i is ai-smooth, where ai is sub-additive. Call C the total outgoing bit rate. Any 
mix of flows satisfying these assumptions is schedulable if 

aflt - di) < Ct 



Ifaflf) € N then the condition is necessary. 
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Proof: A delay based scheduler is a special case of SCED, with target service 

curve Pi = . This shows that the condition in the theorem is sufficient. Con- 

versely, consider the greedy flows given by Rpt) = apt). This is possible because 
ai is assumed to be sub-additive. Flow i?, must be schedulable, thus the output R\ 
satisfies R[{t) > ai{i — dp. Now R[{t) < ct, which proves that the condition 
must hold. □ 

It is shown in [49] that a delay based scheduler has the largest schedulability 
region among all schedulers, given arrival curves and delay budgets for every flow. 
Note however that in a network setting, we are interested in the end-to-end delay 
bound, and we know (Section 1.4.3) that it is generally less than the sum of per hop 
bounds. 

The schedulability of delay based schedulers requires that an arrival curve is 
known and enforced at every node in the network. Because arrival curves are modi- 
fied by network nodes, this motivates the principle of Rate Controlled Service Dis- 
ciplines (RCSDs) [38, 74, 26], which implement in every node a packet shaper fol- 
lowed by a delay based scheduler. The packet shaper guarantees that an arrival curve 
is known for every flow. Note that such a combination is not work conserving. 

Because of the ’’pay bursts only once” phenomenon, RCSD might provide end- 
to-end delay bounds that are worse than guaranteed rate schedulers. However, it is 
possible to avoid this by aggressively reshaping flows in every node, which, from 
Theorem 2.3.2, allows us to set smaller deadlines. If the arrival curves constraints 
on all flows are defined by a single leaky bucket, then it is shown in [59, 58] that 
one should reshape a flow to its sustained rate at every node in order to achieve the 
same end-to-end delay bounds as GR schedulers would. 

Schedulability of GR Schedulers Consider the family of GR schedulers, applied 
to the ATM case. We cannot give a general schedulability condition, since the fact 
that a scheduler is of the GR type does not tell us exactly how the scheduler operates. 
However, we show that for any rate r and delay v we can implement a GR scheduler 
with SCED. 

Theorem 2.3.3 (GR scheduler as SCED, ATM case). Consider the SCED sched- 
uler with I flows and outgoing rate C. Let the target service curve for flow i be 
equal to the rate-latency service curve with rate r , and latency Vi. If 

I 

then the scheduler is a GR scheduler for each flow i, with rate r , and delay Vi. 

Proof: From Proposition 2.3.2: 

Zpf) = [(i?j ® \n)(t - vp\ 

thus Zi is the output of the constant rate server, with rate r,, delayed by Vi. Now 
from Theorem 2.3.1 the condition in the theorem guarantees that i? ' > Zi, thus the 
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delay for any packet of flow i is bounded by the delay of the constant rate server 
with rate r , , plus Vi . □ 

Note the fundamental difference between rate based and delay based schedulers. 
For the former, schedulability is a condition on the sum of the rates; it is independent 
of the input traffic. In contrast, for delay based schedulers, schedulability imposes a 
condition on the arrival curves. Note however that in order to obtain a delay bound, 
we need some arrival curves, even with delay based schedulers. 



Better than Delay Based scheduler A scheduler need not be either rate based or 
delay based. Rate based schedulers suffer from coupling between delay objective 
and rate allocation: if we want a low delay, we may be forced to allocate a large 
rate, which because of Theorem 2.3.3 will reduce the number of flows than can be 
scheduled. Delay based schedulers avoid this drawback, but they require that flows 
be reshaped at every hop. Now, with clever use of SCED, it is possible to obtain 
the benefits of delay based schedulers without paying the price of implementing 
shapers. 

Assume that for every flow i we know an arrival curve a , and we wish to obtain 
an end-to-end delay bound d,. Then the smallest network service curve that should 
be allocated to the flow is a, ® (the proof is easy and left to the reader). Thus a 
good thing to do is to build a scheduler by allocating to flow i the target service 
curve ai ® . The schedulability condition is the same as with a delay based 

scheduler, however, there is a significant difference: the service curve is guaranteed 
even if some flows are not conforming to their arrival curves. More precisely, if 
some flows do not conform to the arrival curve constraint, then the service curve is 
still guaranteed, but the delay bound is not. 

This observation can be exploited to allocate service curves in a more flexible 
way than what is done in Section 2.2 [17]. Assume flow i uses the sequence of 
nodes m = 1, ...,M. Every node receives a part d™ of the delay budget d,, with 
Then it is sufficient that every node implements SCED with a 
target service curve /3™ = ® a, for flow i. The schedulability condition at node 

m is 

aj{t - df) < Cmt 

jeE^ 

where is the set of flows scheduled at node m and Cm is the outgoing rate of 
node TO. If it is satisfied, then flow i receives a, ^ 64 . as end-to-end service curve 
and therefore has a delay bounded by d,. The schedulability condition is the same 
as if we had implemented at node to the combination of a delay based scheduler 
with delay budget d™ ^ and a reshaper with shaping curve a , ; but we do not have to 
implement a reshaper. In particular, the delay bound for flow i at node to is larger 
than d™; we find again the fact that the end-to-end delay bound is less than the sum 
of individual bounds. 

In [65], it is explained how to allocate a service curves /3™ to every network 
element to on the path of the flow, such that (i] ® ® ... = a, ® <5*, in order to 
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obtain a large schedulability set. This generalizes and improves the schedulability 
region of RCSD. 

Extension to variable length packets We can extend the previous results to vari- 
able length packets; we follow the ideas in [10]. The first step is to consider a ficti- 
tious preemptive EDF scheduler (system I), that allocates a deadline to every bit. We 
define Zf (t) as before, as the number of bits whose deadline is < f . A preemptive 
EDF scheduler serves the bits present in the system in order of their deadlines. It 
is preemptive (and fictitious) in that packets are not delivered entirely, but, in con- 
trast, are likely to be interleaved. The results in the previous sections apply with no 
change to this system. 

The second step is to modify system 1 by allocating to every bit a deadline 
equal to the deadline of the last bit in the packet. Call it system II. We have 
Zf^{t) = P^' {Zf (t)) where P^' is the cumulative packet length (Section 1.7) for 
flow i. From the remarks following Proposition 2.3.1, it follows that if system I is 
schedulable, then so is system II. System II is made of a preemptive EDF scheduler 
followed by a packetizer. 

The third step consists in defining “packet-EDF” scheduler (system III); this 
is derived from system II in the same way as PGSP is from GPS. More precisely, 
the packet EDF scheduler picks the next packet to serve among packets present in 
the system with minimum deadline. Then, when a packet is being served, it is not 
interrupted. We also say that system III is the non-preemptive EDF scheduler. Then 
the departure time of any packet in system III is bounded by its departure time in 
system II plus where (max is the maximum packet size across all flows and C 
is the total outgoing rate. The proof is similar to Proposition 2.1.1 and is left to the 
reader (it can also be found in [10]). 

We can apply the three steps above to a SCED scheduler with variable size 
packets, called “Packet-SCED”. 

Definition 2.3.2 (Packet SCED). A PSCED schedulers is a non-premptive EDF 
schedulers, where deadlines are allocated as follows. Call Af the arrival time for 
packet n of flow i. Define functions Rf by: 

Rf (t) = inf [Ri {s)+Pi{t-s)] 
se[o,A"] 

With PSCED, the deadline for packet n of flow i is defined by 

D? = {R?)-\Li{n)) = mm{t G N : > {Lfri))} 

where Li is the cumulative packet length for flow i. Function fli is called the “target 
service curve” for flow i. 

The following proposition follows from the discussion above. 

Proposition 2.3.3. [10] Consider a PSCED scheduler with I flows, total outgoing 
rate C, and target service curve /), for flow i. Call (^ax maximum packet size 
for flow i and let (max = maxj (^^^^. 
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1- If 

I 

y~^ j3i(t) < Ct for alH > 0 (2.18) 

then every packet is served before or at its deadline plus A bound on 
packet delay is h{ai, + Moreover, every flowi receives [A(i— (max)“ 

as a service curve. 

2. Assume that, in addition, we know that every flow i is constrained by an ar- 
rival curve ai. If 



I 

^(q!, ® < Ct for alH > 0 (2.19) 

then the same conclusion holds. 



Note that the hrst part of the conclusion means that the maximum packet delay 
can be computed by assuming that flow i would receive /3, (not (iflt — (^ax)) ^ 

service curve, and adding . 



Proof: It follows from the three steps above that the PSCED scheduler can be 

broken down into a preemptive EDF scheduler, followed by a packetizer, followed 
by a delay element. The rest follows from the properties of packetizers and Theo- 
rem 2.3.1. 



2.3.3 Buffer Requirements 

As we mentioned at the beginning of this section, buffer requirements have to be 
computed in order to accept a reservation. The condition is simply Xi < X 
where A, is the buffer required by flow i at this network element, and X is the 
total buffer allocated to the class of service. The computation of A , is based on 
Theorem 1.4.1; it requires computing an arrival curve of every flow as it reaches 
the node. This is done using Theorem 1 .4.2 and the flow setup algorithm, such as in 
Definition 2.2.1. 

It is often advantageous to reshape flows at every node. Indeed, in the absence 
of reshaping, burstiness is increased linearly in the number of hops. But we know 
that reshaping to an initial constraint does not modify the end-to-end delay bound 
and does not increase the buffer requirement at the node where it is implemented. 
If reshaping is implemented per flow, then the burstiness remains the same at every 
node. 
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2.4 Application to Differentiated Services 

2.4.1 Differentiated Services 

In addition to the reservation based services we have studied in Section 2.2, the 
Internet also proposes differentiated services [6]. The major goal of differentiated 
services is to provide some form of better service while avoiding per flow state 
information as is required by integrated services. The idea to achieve this is based 
on the following principles. 

• Traffic classes are defined; inside a network, all traffic belonging to the same 
class is treated as one single aggregate flow. 

• At the network edge, individual flows (called “micro-flows”) are assumed to 
conform to some arrival curve, as with integrated services. 

If the aggregate flows receive appropriate service curves in the network, and if the 
total traffic on every aggregafe flow is nof too large, then we should expect some 
bounds on delay and loss. The condition on microflows is key to ensuring that the 
total aggregate traffic remains within some arrival curve constraints. A major diffi- 
culty however, as we will see, is to derive bounds for individual flows from charac- 
teristics of an aggregate. 

Differentiated services is a framework that includes a number of different ser- 
vices. The main two services defined today are expedited forwarding (EF)[37] and 
assured forwarding (AF)[34j. The goal of FF is to provide to an aggregate some 
hard delay guarantees, and no loss. The goal of AF is to separate traffic befween 
a small number of classes (4); inside each class, fhree levels of drop priorities are 
defined. One of fhe AF classes could be used fo provide a low delay service with no 
loss, similar to FF. 

In this chapter, we do not study in detail the implementations of FF or AF. 
We focus on the fundamental issue of how aggregate scheduling impacts delay and 
throughput guarantees. In the rest of this section, we use the network model shown 
on Figure 2.6. Our problem is to find bounds for end-fo-end delay jitter on one hand, 
for backlog at all nodes on the other hand, under the assumptions mentioned above. 
Delay jitter is is the difference between maximum and minimum delay; its value 
decides the size if playout buffers (Section 1.1.3). 

2.4.2 A Bounding Method for Aggregate Scheduling 

Consider a node m. If all traffic entering the node is access traffic only (no transit 
traffic) fhen an arrival curve for fhe aggregate traffic enfering fhe node is XljGf 
where is the set of flows fhat enter the network at this node. The aggregate flow 
is guaranteed a service curve Sm, thus, from Section 1.4.1, the total required buffer 
is bounded by v{Y.ieF^ a*, 5^), and the delay by v{Y.ieF^ 'S'™). 

In order to use Section 1.4.1 we need now to find an arrival curve for fhe Iraffic 
exiting node to. This in general requires more assumptions on the aggregate sched- 
uler. We examine in Chapter 6 the properties of FIFO schedulers and will find fhere 
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R4 R5 k. 




Figure 2.6; Network Model for Differentiated Services. Microflows Ri,...Re 
are individually shaped and each conform to some arrival curve a^. At node 
1, microflows i?i to R 3 are handled as one aggregate flow, which receives a 
service curve 5i. Upon leaving node 1, the different microflows take different 
paths and become part of other aggregates at other nodes. 



some strong, but complex results. We have a simple result if the service curves are 
strict, as defined in Section 1.3.1; this applies to many practical cases, in particular 
to all variable capacity nodes and priority schedulers. 

Theorem 2.4.1. Consider a node serving two flows in an aggregate manner. Assume 
the aggregate is guaranteed a strict service curve fl. Assume also that the second 
flow is a 2 -smooth. Then the first flow is guaranteed a minimum service curve 
given by 

= [/3(f) - a2{t)] + 



Proof: Call Ri and i?' the input and output, for i = 1,2. Fix some arbitrary t and 

call s the beginning of the busy period (for the aggregate). Thus 

3^1 (f) "h 3^2 (f) — (s) + R 2 (s) + /3(f ~ s) 

Now 

i? 2 (f) < R 2 (t) < i? 2 (s) + a 2 (t - s) 

thus 

R'l (t) > Ri (s) + I3(t - s) - a 2 (t - s) (2.20) 

Since s is the beginning of the busy period, we have R [ (s) + J ?2 (s) = Ri (s) +R 2 (s) 
and therefore R[ (s) = Ri (s) . Thus 

R'i(t) > R'i(s) = Ri(s) 

combining this with Equation (2.20), we find 



R'i(t) > Ri(s) + [fl{t - s) - a 2 {t - s)]"*" 
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which proves the service curve property. □ 

If the arrival curves are affine, then the following corollary of Theorem 2.4. 1 
expresses the burstiness increase due to multiplexing. 

Corollary 2.4.1. Consider a node serving two flows in an aggregate manner. As- 
sume the aggregate is guaranteed a strict service curve I3 r^t- Assume also that flow 
i is constrained by one leaky bucket with parameters (r , , 6,). The output of the first 
flow is constrained by a leaky bucket with parameters (ri , 6 j) with 



h 



* 

1 



hi + riT + n 



62 + T 2 T 
R- r2 



Note that the burstiness increase contains a term riT that is found even if there 
is no multiplexing; the second term ri comes from multiplexing with flow 

2. Note also that if we further assume that the node is FIFO, then we have a better 
bound (Chapter 6 ). 

Proof: From Theorem 2.4.1, the first flow is guaranteed a service curve (i r^t’ 

with R' = R — r 2 and T' = ^^ 3 ^. The result follows from a direct application of 
Theorem 1.4.3. □ 

We could think of using Corollary 2.4.1 to analyze the general network in Fig- 
ure 2.6. However, this would work only in a feed-forward network, namely, a net- 
work where we can number nodes such that the sequence of node numbers followed 
by any flow is always increasing. Equivalently, this means that there is no cycle for 
the relation between nodes dehned by to to' iff among the flows entering to', at 
least one exits to. In a feed-forward, we can recursively apply Corollary 2.4.1 and 
find arrival curve constraints at all nodes, thus backlog and delay bounds. 

However, feed-forward topologies are very restrictive, and most networks are 
not feedforward. We show now an analysis method that can be applied to non- 
feedforward networks. 

Assumptions and notation We consider in the rest of this subsection the case 
where all fresh arrival curves a, have the form 

tT-i — Tvi ^bi 

In other words, the constraints are expressed by a single leaky bucket. To simplify 
the notation, we consider that the nodes are are constant rate, work conserving 
servers, with rate Cm for node to. We assume that there is no loop on the path 
followed by any flow. We use the notation to € i or i 9 to to express that node to is 
on the path of flow i. 

We call “utilization factor”at link I the ratio 

y' ■=. Ti 

,, * 

' “ Cl 
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We also assume that the network is “sub-critical”, that is, for all node m: 



vi <1 



(2.21) 



We will illustrate the method on a specific example, shown on Figure 2.7. 

In a non-feedforward network, we cannot directly apply Corollary 2.4.1. How- 
ever, we can use the following trick, called the “time stopping method”, which was 
introduced in [19]. The method has two steps. First, we assume that there is a finite 
burstiness bound for all flows; using Corollary 2.4.1 we obtain some equations for 
computing these bounds. Second, we use the same equations to show that, under 
some conditions, finite bounds exist. 



Figure 2.7: A simple example with aggregate scheduling, used to illustrate 
the bounding method. There are three nodes numbered 0, 1, 2 and six flows, 
numbered 0, ..., 5. For i = 0, 1, 2, the path of flow i is i, {i + 1) mod 3, {i + 2) 
mod 3 and the path of flow i -1-3 is i, (i 4-2) mod 3, (i 4-1) mod 3. The fresh 
arrival curve is the same for all flows, and is given by a, = 7^,^- All nodes 
are constant rate, work conserving servers, with rate C. The utilization factor 
at all nodes is 6 ^. 



First step: inequations for the bounds For any flow i and any node m G i, define 
6™ as the maximum backlog that this flow would generate in a constant rate server 
with rate r j . By convention, the fresh inputs are considered as the outputs of a virtual 
node numbered —1. In this first step, we assume that 6™ is finite for all i and m G i. 

From Lemma 1.2.2 on Page 12, fo™ is the smallest value such that the output of 
node TO is constrained by the leaky bucket with parameters (r ,, 6™). By applying 
Corollary 2.4.1 we find that for all i and m G i: 



Node 2 




Node 1 




(2.22) 



where predj(TO) is the predecessor of node to. If to is the first node on the path of 
flow i, we set by convention pred^ ( to) = —1 and b~^ = bi. 
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Now put all the 6™, for all {i, m) such that m G i, into a vector x with one 
column and n rows, for some appropriate n. We can re-write Equation (2.22) as 

X < Ax + a (2.23) 



where Ais ann x n, non-negative matrix and a is a non-negative vector depending 
only on the known quantities 6,. The method now consists in assuming that the 
spectral radius of matrix A is less than 1. In that case the power series I + A + A“^ + 
A^ + ... converges and is equal to (7 — A) “ ^ , where I is the n x n identity matrix. 
Since A is non-negative, (7 — A)~^ is also non-negative; we can thus multiply 
Equation (2.22) to the left by (7 — A) and obtain: 

x<{I-A)-^a (2.24) 



which is the required result, since x describes the burstiness of all flows at all nodes. 
Erom there we can obtain bounds on delays and backlogs. 

Let us apply this step to our network example. By symmetry, we have only two 
unknowns x and y, deflned as the burstiness after one and two hops: 

f X = bQ = b\ = b2 = bn = = bl 

{ y = bl = bj=b^2=bl=bl = bt 
Equation (2.22) becomes 

f X < (7 + + 2x + 2y) 

\ y <x + + x + 2y) 

Deflne rj = from Equation (2.21) we conclude that 0 < rj < 1. We can now 

write Equation (2.23) with 



X = 



X 

y 



( 2rj 2rj 

1 -I- r? 2f] 



a = 



a(l + r]) \ 
2ar] ) 



Some remnant from linear algebra, or a symbolic computation software, tells us that 



(7-A)-i 



l-2»i 

1 — 6rj-|-2rj^ 

1+t] 

1 — 6rj-|-2rj^ 



2»i \ 

l-6?)-|-2»)2 1 

l-2»i 

l-6?)-|-2»)2 / 



If r? < i(3 - v/7) « 0.177 then (7 - A)-^ is positive. This is the condition for the 
spectral radius of A to be less than 1 . The corresponding condition on the utilization 
factor 1 / = is 

8 - v/7 

1/ < 2 « 0.564 (2.25) 

Thus, for this speciflc example, if Equation (2.25) holds, and if the burstiness terms 
X and y are flnite, then they are bounded as given in Equation (2.24), with {I — A) 
and a given above. 
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Second Step: time stopping We now prove that there is a finite bound if the 
spectral radius of A is less than 1. For any time r > 0, consider the virtual system 
made of the original network, where all sources are stopped at time r. For this 
network the total number of bits in finite, thus we can apply the conclusion of step 
1 , and the burstiness terms are bounded by Equation (2.24). Since the right-handside 
Equation (2.24) is independent of r, letting r tend to +oo shows the following. 

Proposition 2.4.1. With the notation in this section, if the spectral radius of A is 
less than 1, then the burstiness terms 6™ are bounded by the corresponding terms 
in Equation (2.24). 



Back to the example of Eigure 2.7, we find that if the utilization factor v is less 
than 0.564, then the burstiness terms x and y are bounded by 



X < 2a 

y < 2cr 



18-33i^+16i^^ 

36-96!^+57i^2 

18 - 18 i/+i^^ 



36-96!^+57i^2 



The aggregate traffic at any of the three nodes is 76 p,(>-smooth with b = 2{a+x+y). 
Thus a bound on delay is (see also Figure 2.8): 

b ^ cr 108 - 1987/ + 91 z/2 
~C~ 36 - 967/ + 577/2 




Figure 2.8: The bound d on delay at any node obtained by the method pre- 
sented here for the network of Figure 2.7 (thin line). The graph shows d 
normaiized by ^ (namely, ^), plotted as a function of the utilization factor. 
The thick line is a deiay bound obtained if every flow is re-shaped at every 
output. 



What happens when the hound is infinite ? Call t/q the upper bound on uti- 
lization factors for which the method in Section 2.4.2 finds a finite bound. In the 
example, we found t/q « 0.564, which is much less than 1. Does it mean that no 
finite bound exists when t/q < 1 ? The answer to this question is not clear. 
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First, the i/q found with the method can be improved if we express more arrival 
constraints. Consider our particular example: we have not exploited the fact that the 
fraction of input traffic to node i that originates from another node has to be A c- 
smooth. If we do so, we will obtain better bounds. Second, if we know that nodes 
have additional properties, such as FIFO, then we may be able to find better bounds. 

In general, the issue of stability, or existence of finite bounds, for a given network 
with aggregate multiplexing is still unresolved. The question of delay bounds for a 
network with aggregate scheduling was first raised in [7]. We will see in Chapter 6 
that if the network is a ring of constant bit rate servers, then i/q = 1, in other words, 
we can find a bound for any i/ < 1 [69]. However, the proof does not extend to gen- 
eral service curve guarantees, even strict service curves. In [3], the author exhibits 
an unstable network similar to ours, with i/ = 1 (a “critical” network), and claims 
that this also holds for some sub-critical networks, but the proof is not conclusive. 
In addition, a startling fact referred to as a “non-monotone property” of FIFO net- 
works is described in [3], where it is shown that a network that is stable with a set of 
sessions with given rates may become unstable if the rates of some of these sessions 
are reduced for a period of time. On the other side of the spectrum, we will also see 
in Chapter 6 strong and explicit results [46, 13] with FIFO multiplexing. 

The price for aggregate scheduling Consider again the example on Figure 2.7, 
but assume now that every flow is reshaped at every output. This is not possible 
with differentiated services, since there is no per-flow information at nodes other 
than access nodes. However, we use this scenario as a benchmark that illustrates the 
price we pay for aggregate scheduling. 

With this assumption, every flow has the same arrival curve at every node. Thus 
we can compute a service curve j3\ for flow 1 (and thus for any flow) at every node, 
using Theorem 2.4.1; we find that /3i is the rate-latency function with rate (C — 5p) 
and latency Thus a delay bound for flow at any node, including the re-shaper, 

is h{ai,ai ® (3i) = h{ai,(3i) = for p < ^. Figure 2.8 shows this delay 

bound, compared to the delay bound we found if no reshaper is used. As we already 
know, we see that with per-flow information, we are able to guarantee a delay bound 
for any utilization factor < 1. However, note also that for relatively small utilization 
factors, the bounds are very close. 

2.4.3 An Explicit Delay Bound for Differentiated Services Net- 
works 

We consider a low delay traffic class, as mentioned in Section 2.4.1, and find a 
closed form expression for the worst case delay, which is valid in any topology, in a 
lossless network. This bound is based on the method in the previous section. 

Assumption and Notation We assume the following. Unlike with reservation ser- 
vices, we cannot assume here that we have detailed information about the path fol- 
lowed by every flow. In contrast, we have only aggregate parameters. 
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• All fresh arrival curves a, have the form of single leaky bucket constraints: 

— Tpi 

• The service curve provided to the aggregate of all low delay traffic at node 
TO is a rate-latency function with rate and latency Cm- The service curve 
guarantee need not be a strict service curve. Within an aggregate, packets are 
served in a FIFO manner. Let i? be a bound on all e m ■ 

• The utilization factor for node to is defined as 



— 

^ m 

We assume that !/„<!/ for all to. i/ is the network-wide bound on utilization 
that is admissible for low delay traffic. 

• For any node to let ^ ^ p. be a bound on all r^. 

r is a time constant that, roughly speaking, gives the maximum packet delay 
variation for any micro-flow. 

• The route of any flow in the network traverses at most h nodes (also referred 
to as hops). 

• Let Cm denote a bound on the peak rate of all incoming low delay traffic 
traffic at node to. If we have no information about this peak rate, then Cm = 
-boo. For a router with large internal speed and buffering only at the output. 
Cm is the sum of the bit rates of all incoming links. The delay bound is better 
for a smaller Cm ■ 

• Let Um = ■ Note that 0 < Um < 1, Um increases with Cm, and if 

Cm = + 00 , then Um = 1- Call u = max^ Um- The parameter u expresses 
how much we gain by knowing the maximum incoming rates Cm- 

Theorem 2.4.2 (Closed form bound for delay and backlog [12]). Ifi/<i/Q = 
min/ ^ bound on delay variation for low delay traffic is liD \ 

with 

E + UTV 

^ 1 — (h — l)w/ 

At node to, the buffer required for serving low delay traffic without loss is bounded 
by 



Breq = Tm^ [E + T + Di(h - l)]~rm {I'iE + t) - Di[l - (h - < VmDi 
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Proof We use the method in the previous section and show separately that (1) if a 
finite bound for delay exists, then the formula in the theorem is true, ( 2 ) that a finite 
bound exists then (3) we show the formula for backlog. 

(Part 1) We assume that a finite bound exists. Call D' the worst case delay across 
any node. Consider a buffer at some node m. An arrival curve for the aggregate 
traffic at the input of is 



a(t) = min j tCm, cti(t + (h — I)D') 

\ i3m 

The former part in the formula is because the total incoming bit rate is limited by 
Cm ; the latter is because any flow reaching that node has undergone a delay bounded 
by (/i - l)i2'.Thus 



a{t) < a(t) = min (tCmjtiyrm + b') 



with 

b' = vrm{r + {h- l)i 2 ') 

A bound on the delay at our node is given by the horizontal deviation between the 
arrival curve a'{t) and the service curve. After some algebra, this gives a bound on 
the delay as Cm + ^ 7 ^- Now D' is the worst case delay; pick to to be a node where 
the worst case delay D' is attained. We must thus have 



from which we derive 



D' < Cm + 



D' < E + ui/{t + {h — l)wD') 



or equivalently 

(1 - (/i - l)w) D' <E + UTV (2.26) 

The condition on v means that v < Equation (2.26) implies then that 



D' < Di 



E + UTV 
1 — {h — l)uv 



The end-to-end delay is thus bounded by hD 1 , which is the required formula. 

(Part 2) We now prove that a hnite bound exists, using the time-stopping method. 
For any time f > 0, consider the virtual system made of the original network, where 
all sources are stopped at time t. This network satisfies the assumptions of part 1, 
since there is only a finite number of bits at the input. Call D ' (t) the worst case delay 
across all nodes for the virtual network indexed by t. From the above derivation we 
see that D'{t) < Di for all t. Letting t tend to -boo shows that the worst case delay 
at any node remains bounded by 1 . 




2.4. APPLICATION TO DIFFERENTIATED SERVICES 



113 



(Part 3) An arrival curve for the total incoming traffic at node m is min[C'mf, cti (t)], 
with 

ai{t) = lyrmlt + t + {h - l)Hi] 

A service curve is PE,rm- Thus a bound on backlog is obtained as Breq = 
supj{min[C'mf, Q!i(f)] — Define 6 as fhe solution to ai{9) = Cmd- 

We have 

Breq = (max(S, 6)) - l3{max{E, 6)) = m {{6 - E)+ + E] - r^(9 - E)+ 
Now 

0 = ^ ^ ^ (1 - u)[t + {h- l)Di] 

After some algebra, this gives the desired result. □ 

Discussion: If we have no information about the peak incoming rate C i, then we 

set Cl = +00 and the theorem tells us that, for v < a bound on delay is 

{E + tu). 

For finite values of Cm, the delay bound is smaller. Figure 2.9 illustrates the 
value of our bound on one example 




Figure 2.9: The bound D (in seconds) in Theorem 2.4.2 versus the utilization 
factor 1 / for /i = 10, E = 2 , o-j = lOOB and pi = 32kb/s for all flows, 

Tm = 149.760Mb/s, and Cm = +oo (thin line) or Cm = ‘2rm (thick line). 

Our bound explodes when i/ tends to i/q = min/ . which is 

similar to what we can see in Section 2.4.2. The same discussion applies here, that 
is to say, we do not know whether instability may occur or not, in general. In practice 
however, we should not expect the bound in Theorem 2.4.2 to be tight for values of 
1 / close to z/Q. To see why, consider the example in Section 2.4.2. We can apply 
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Theorem 2.4.2 and find that i/q = 0.5. However, we know a finite bound that is 
valid for i/ < 0.564, so explosion does not occur at i/q. The value of Theorem 2.4.2 
is to give an explicit, closed form bound, which is valid for small utilization factors. 
In Chapter 6, we hnd better bounds, at the expense of more restrictions on the routes 
and the rates. Such restrictions do not ht with the differentiated services framework. 
Note also that, for feed-forward networks, we know that there are hnite bounds for 
1 / < 1. However we show now that the condition i/ < i/q is optimal, in some sense. 

Proposition 2.4.2. [4, 12] With the assumptions of Theorem 2.4.2, iff < then 
for any D' > 0, there is a network in which the worst case delay is at least D' . 

In other words, the worst case queuing delay can be made arbitrarily large; thus 
if we want to go beyond Theorem 2.4.2, any bound for differentiated services must 
depend on the network topology or size, not only on the utilization factor and the 
number of hops. 

Proof: We build a family of networks, out of which, for any ', we can exhibit 

an example where the queuing delay is at least D' . 

The thinking behind the construction is as follows. All flows are low priority 
flows. We create a hierarchical network, where at the hrst level of the hierarchy we 
choose one “flow” for which its hrst packet happens to encounter just one packet of 
every other flow whose route it intersects, while its next packet does not encounter 
any queue at all. This causes the hrst two packets of the chosen How to come back- 
to-back after several hops. We then construct the second level of the hierarchy by 
taking a new How and making sure that its hrst packet encounters two back-to-back 
packets of each how whose routes it intersects, where the two back-to-back packet 
bursts of all these hows come from the output of a sufficient number of networks 
constructed as described at the hrst level of the hierarchy. Repeating this process 
recursively sufficient number of times, for any chosen delay value D we can create 
deep enough hierarchy so that the queuing delay of the hrst packet of some flow 
encounters a queuing delay more than D (because it encounters a large enough back- 
to-back burst of packets of every other flow constructed in the previous iteration), 
while the second packet does not suffer any queuing delay at all. We now describe 
in detail how to construct such a hierarchical network (which is really a family of 
networks) such that utilization factor of any link does not exceed a given factor v, 
and no flow traverses more than h hops. 

Now let us describe the networks in detail. We consider a family of networks 
with a single traffic class and constant rate links, all with same bit rate C . The 
network is assumed to be made of inflnitely fast switches, with one output buffer 
per link. Assume that sources are all leaky bucket constrained, but are served in an 
aggregate manner, hrst in hrst out. Leaky bucket constraints are implemented at the 
network entry; after that point, all flows are aggregated. Without loss of generality, 
we also assume that propagation delays can be set to 0; this is because we focus 
only on queuing delays. As a simpliflcation, in this network, we also assume that all 
packets have a unit size. We show that for any flxed, but arbitrary delay budget D, 
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we can build a network of that family where the worst case queueing delay is larger 
than D, while each flow traverses at most a specified number of hops. 

A network in our family is called JV{h, v, J) and has three parameters: h (max- 
imum hop count for any flow), i/ (utilization factor) and J (recursion depth). We 
focus on the cases where h > 3 and < i/ < 1, which implies that we can 
always find some integer k such that 



1 kh + 1 
h — 1 kh — 1 



(2.27) 



Network Af{h, v, J) is illustrated in Figures 2. 10 and 2. 1 1 ; it is a collection of iden- 
tical building blocks, arranged in a tree structure of depth J. Every building block 
has one internal source of traffic (called “transif traffic”), kh{h — l) inputs (called the 
“building block inputs”), kh{h — 1) data sinks, h — 1 internal nodes, and one output. 
Each of the h — 1 internal nodes receives traffic from kh building block inputs plus it 
receives transit traffic from the previous internal node, with the exception of the first 
one which is fed by the internal source. After traversing one internal node, traffic 
from the building block inputs dies in a data sink. In contrast, transit traffic is fed to 
the next internal node, except for the last one which feeds the building block output 
(Eigure 2.10). Eigure 2. 1 1 illustrates that our network has the structure of a complete 



buffer 




multiplexer demultiplexer 



1 data 

source (/?-!) M inputs 




Figure 2.10: The internal node (top) and the building block (bottom) used in 
our network example. 

tree, with depth J. The building blocks are organized in levels j = 1, ..., J. Each of 
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the inputs of a level j building block (j > 2) is fed by the output of one level j — 1 
building block. The inputs of level 1 building blocks are data sources. The output of 
one j — 1 building block feeds exactly one level j building block input. At level J, 
there is exactly one building block, thus at level J — 1 there are kh{h — 1) building 
blocks, and at level 1 there are {kh{h — 1))'^“^ building blocks. All data sources 




Figure 2.11: The network made of building blocks from Figure 2.10 

have the same rate r = and burst tolerance 6=1 packet. In the rest of this 

section we take as a time unit the transmission time for one packet, so that C = 1. 
Thus any source may transmit one packet every 9 = M±i time units. Note that a 
source may refrain from sending packets, which is actually what causes the large 
delay jitter. The utilization factor on every link is i/, and every flow uses 1 or /i hops. 

Now consider the following scenario. Consider some arbitrary level 1 building 
block. At time to, assume that a packet fully arrives at each of the building block 
inputs of level 1, and at time to + 1, let a packet fully arrive from each data source 
inside every level 1 building block (this is the first transit packet). The first transit 
packet is delayed hy hk — 1 time units in the first internal node. Just one time unit 
before this packet leaves the first queue, let one packet fully arrive at each input of 
the second internal node. Our first transit packet will be delayed again hy hk — 1 
time units. If we repeat the scenario along all internal nodes inside the building 
block, we see that the first transit packet is delayed by {h — l){hk — 1) time units. 
Now from Equation (2.27), 9 < (h— l)(hk — 1), so it is possible for the data source 
to send a second transit packet at time {h — l)(hk — 1). Let all sources mentioned 
so far be idle, except for the emissions already described. The second transit packet 
will catch up to the first one, so the output of any level 1 building block is a burst 
of two back-to-back packets. We can choose to arbitrarily, so we have a mechanism 
for generating bursts of 2 packets. 

Now we can iterate the scenario and use the same construction at level 2. The 
level-2 data source sends exactly three packets, spaced by 9. Since the internal node 
receives hk bursts of two packets originating from level 1, a judicious choice of the 
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level 1 starting time lets the first level 2 transit packet find a queue of 2hk — 1 packets 
in the first internal node. With the same construction as in level 1, we end up with a 
total queuing delay of {h — l){2hk — 1) > 2(h — l){hk — 1) > 26 for that packet. 
Now this delay is more than 26, and the first three level-2 transit packets are delayed 
by the same set of non-transit packets; as a result, the second and third level-2 transit 
packets will eventually catch up to the first one and the output of a level 2 block is 
a burst of three packets. This procedure easily generalizes to all levels up to J. In 
particular, the first transit packet at level J has an end-to-end delay of at least .16. 
Since all sources become idle after some time, we can easily create a last level J 
transit packet that finds an empty network and thus a zero queuing delay. 

Thus there are two packets in network Af{h,iy, J), with one packet having a 
delay larger than J6, and the other packet has zero delay. This establishes that a 
bound on queuing delay, and thus on delay variation in network JV{h, v, J) has to 
be at least as large as J6. □ 

2.4.4 Bounds for Aggregate Scheduling with Dampers 

At the expense of some protocol complexity, the previous bounds can be improved 
without losing the feature of aggregate scheduling. It is even possible to avoid bound 
explosions at all, using the concepts of damper. Consider an EDF scheduler (for 
example a SCED scheduler) and assume that every packet sent on the outgoing link 
carries a field with the difference d between its deadline and its actual emission time, 
if it is positive, and 0 otherwise. A damper is a regulator in the next downstream node 
that picks for the packet an eligibility time that lies in the interval [a + d — A,a + d], 
where A is a constant of the damper, and a is the arrival time of the packet in the 
node where the damper resides. We call A the “damping tolerance”. The packet 
is then withheld until its eligibility time [72, 17], see Figure 2.12. In addition, we 
assume that the damper operates in a FIFO manner; this means that the sequence of 
eligibility times for consecutive packets is wide-sense increasing. 

Unlike the scheduler, the damper does not exist in isolation. It is associated with 
the next scheduler on the path of a packet. Its effect is to forbid scheduling the packet 
before the eligibility time chosen for the packet. Consider Figure 2.12. Scheduler m 
works as follows. When it has an opportunity to send a packet, say at time t, it picks 
a packet with the earliest deadline, among all packets that are present in node N, 
and whose eligibility date is > t. The timing information d shown in the figure is 
carried in a packet header, either as a link layer header information, or as an IP hop 
by hop header extension. At the end of a path, we assume that there is no damper at 
the destination node. 

The following proposition is obvious, but important, and is given without proof. 

Proposition 2.4.3. Consider the combination S of a scheduler and its associated 
damper. If all packets are served by the scheduler before or at their deadlines, then 
S provides a bound on delay variation equal to A. 

It is possible to let A = 0, in which case the delay is constant for all packets. A 
bound on the end-to-end delay variation is then the delay bound at the last scheduler 
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Switching Fabric 

Scheduler k -j-q From Clamper n Scheduler m 




Departure from I Deadline at / 

^ L 



' \ 

r ] — - — I 1 Packet sent 

1 1 — - — 1 — ^ 1 from M to N 



Arrival at node N 



a a+ d -8 ^ 

Eligibility time picked by damper / 




Figure 2.12: Dampers in a differentiated services context. The model shown 
here assumes that routers are made of infinitely fast switching fabrics and 
output schedulers. There is one logical damper for each upstream scheduler. 
The damper decides when an arriving packet becomes visible In the node. 
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using the combination of a scheduler and a damper (this is called “jitter EDD” in 
[72]). In practice, we consider A > 0 for two reasons. Firstly, it is impractical to 
assume that we can write the field d with absolute accuracy. Secondly, having some 
slack in the delay variation objective provides better performance to low priority 
traffic [17]. 

There is no complicated feasibility condition for a damper, as there is for sched- 
ulers. The operation of a damper is always possible, as long as the there is enough 
buffer. 

Proposition 2.4.4 (Buffer requirement for a damper). If all packets are served 
by the scheduler before or at their deadlines, then the buffer requirement at the 
associated damper is bounded by the buffer requirement at the scheduler. 

Proof: Call R^f) the total input to the scheduler, and R'{f) the amount of data 

with deadline < t. Call R*(t) the input to the damper, we have R*{f) < R{f). 
Packets do not stay in the damper longer than until their deadline in the scheduler, 
thus the output i?i (t) of the damper satisfies Ri{t) > R' (t) . The buffer requirement 
at the scheduler at time t is R(t) — R'{f)', at the damper it is R*(t) — Ri{f) > 
R{t)-R'{t). □ 

Theorem 2.4.3 (Delay and backlog bounds with dampers). Take the same as- 
sumptions as in Section 2.4.3, plus we assume that every scheduler m that is not an 
exit point is associated with a damper in the next downstream node, with damping 
tolerance A^- Let A be a bound on all A m- 

Ifv< 1, then a bound on the end-to-end delay jitter for low delay traffic is 

D = E -\- (h — 1)A(1 -b uv) -b UTV 

A bound on the queuing delay at any scheduler is 

D 2 = E -\- w[t -b (/i — 1)A] 

The buffer required at scheduler m, for serving low delay traffic without loss is 
bounded by 

Breq = [E -\- T -\- A(h - 1)] - {u[E -b T -b A(h - 1)] - D 2 }~^ < rmT >2 

A bound on the buffer required at damper m is the same as the buffer required at 
scheduler m. 

Proof: The variable part of the delay between the input of a scheduler and the 

input of the next one is bounded by A. Now let us examine the last scheduler, say 
TO, on the path of a packet. The delay between a source for a flow i B m and 
scheduler to is a constant plus a variable part bounded by (/i — 1) A. Thus an arrival 
curve for the aggregate low-delay traffic arriving at scheduler to is min Cmt, Q !2 (t), 
with 
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a^it) = i/rm(t + r + {h - 1)A) 

By applying Theorem 1.4.2, a delay bound at scheduler m is given by 
D 2 = E + w[t + (h — 1)A] 

A bound on end-to-end delay variation is {h — 1)A -|- D 2 , which is the required 
formula. 

The derivation of the backlog bound is similar to that in Theorem 2.4.2. □ 

The benefit of dampers is obvious: there is no explosion to the bound, it is finite 
(and small if A is small) for any utilization factor up to 1 (see Figure 2.13). Further- 
more, the bound is dominated by hA, across the whole range of utilization factors 
up to 1. A key factor in obtaining little delay variation is to have a small damping 
tolerance S. 




Figure 2.13: The bound D (in seconds) in Theorem 2.4.3 the same param- 
eters as Figure 2.9, for a damping tolerance A = 5 ms per damper, and 
Cm = +00 (thick line). The figure also shows the two curves of Figure 2.9, 
for comparison. The bound is very close to hA = 0.05s, for all utilization 
factors up to 1. 



There is a relation between a damper and a maximum service curve. Consider 
the combination of a scheduler with minimum service curve (3 and its associate 
damper with damping tolerance A. Call p the fixed delay on the link between the 
two. It follows immediately that the combination offers the maximum service curve 
(3 (8) ()p-A and the minimum service curve (3 ^6p. Thus a damper may be viewed as 
a way to implement maximum service curve guarantees. This is explored in detail 
in [17]. 
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2.5 Exercises 

Exercise 2.1. Consider a guaranteed rate scheduler, with rate R and delay v, that 
receives a packet flow with cumulative packet length L. The (packetized) scheduler 
output is fed into a constant bit rate trunk with rate c > R and propagation delay 
T. 

1. Find a minimum service curve for the complete system. 

2. Assume the flow of packets is (r, b) -constrained, with b > /max- Find a bound 
on the end-to-end delay and delay variation. 

Exercise 2.2. Assume all nodes in a network are of the GR type. A flow with T- 
SPEC a{t) = min(ri + 6, M + pt) has performed a reservation with rate R across 
a sequence ofH nodes. Assume no reshaping is done. What is the buffer requirement 
at the hth node along the path, for h = 1, ...H ? 

Exercise 2.3. Assume all nodes in a network are made of a shaper followed by a 
GR scheduler. A flow with T-SPEC a{t) = min(rt + 6, M + pt) has performed a 
reservation with rate R across a sequence of H nodes. Assume that the shaper at 
every node uses the shaping curve a = ‘yr,b- What is the buffer requirement at the 
hth node along the path, for h = 1, ...H ? 

Exercise 2.4. Assume all nodes in a network are made of a shaper followed by a 
EIFO multiplexer. Assume that flow I has T-SPEC, aflf) = min(r,i + 6,, M -\-pf), 
that the shaper at every node uses the shaping curve a i = ‘yn,bi for flow i. Eind the 
schedulability conditions for every node. 

Exercise 2.5. A network consists of two nodes in tandem. There are n \ flows of type 
1 and ri 2 flows of type 2. Flows of type i have arrival curve aflf) = rf-\-bi, / = 1, 2. 
All flows go through nodes 1 then 2. Every node is made of a shaper followed by 
an EDE scheduler. At both nodes, the shaping curve for flows of type i is some a i 
and the delay budget for flows of type i is di. Every flow of type i should have a 
end-to-end delay bounded by D i. Our problem is to find good values ofd\ and d 2 . 

1. We assume that Oi = a,. What are the conditions ond\ and d 2 for the end-to- 
end delay bounds to be satisfied ? What is the set of{n\,n 2 ) that are schedu- 
lable ? 

2. Same question if we set o', = 

Exercise 2.6. Consider the scheduler in Theorem 2.3.3. Eind an efficient algorithm 
for computing the deadline of every packet. 

Exercise 2.7. Consider a SCED scheduler with target service curve for flow i given 
by 

Pi = ln,bi ® ^di 

Eind an efficient algorithm for computing the deadline of every packet. 

Flint: use an interpretation as a leaky bucket. 
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Exercise 2.8. Consider the example of Figure 2. 7. Apply the method of Section 2.4.2 
but express now that the fraction of input traffic to node i that originates from an- 
other node must have Xc as an arrival curve . What is the upper-bound on utilization 
factors for which a bound is obtained ? 

Exercise 2.9. Consider the delay bound in Theorem 2.4.2. Take the same assump- 
tions but assume also that the network is feedforward. Which better bound can you 
give ? 




Chapter 3 

Basic Min-plus and Max-plus 
Calculus 



In this chapter we introduce the basic results from Min-plus that are needed for the 
next chapters. Max-plus algebra is dual to Min-plus algebra, with similar concepts 
and results when minimum is replaced by maximum, and infimum by supremum. As 
basic results of network calculus use more min-plus algebra than max-plus algebra, 
we present here in detail the fundamentals of min-plus calculus. We briefly discuss 
the care that should be used when max and min operations are mixed at the end of 
the chapter. A detailed treatment of Min- and Max-plus algebra is provided in [24], 
here we focus on the basic results that are needed for the remaining of the book. 
Many of the results below can also be found in [10] for the discrete-time setting. 



3.1 Min-plus Calculus 

In conventional algebra, the two most common operations on elements of Z or K are 
their addition and their multiplication. In fact, the set of integers or reals endowed 
with these two operations verify a number of well known axioms that define alge- 
braic structures: (Z , -b, x ) is a commutative ring, whereas (K, +, x ) is a field. Here 
we consider another algebra, where the operations are changed as follows: addition 
becomes computation of the minimum, multiplication becomes addition. We will 
see that this defines another algebraic structure, but let us first recall the notion of 
minimum and infimum. 

3.1.1 Infimum and Minimum 

Let iS be a nonempty subset of K. S is bounded from below if there is a number M 
such that s > M for all s G S. The completeness axiom states that every nonempty 
subset iS of K that is bounded from below has a greatest lower bound. We will call 
it infimum of S, and denote it by inf S. For example the closed and open intervals 
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[a, b] and (a, b) have the same infimum, which is a. Now, if S contains an element 
that is smaller than all its other elements, this element is called minimum of S, 
and is denoted hy min S. Note that the minimum of a set does not always exist. For 
example, (a, b) has no minimum since a ^ (a,b). On the other hand, if the minimum 
of a set S exists, it is identical to its infimum. For example, min[a, b] = inf [a, b] = 
a. One easily shows that every finite nonempty subset of K has a minimum. Finally, 
let us mention that we will often use the notation A to denote infimum (or, when if 
exisfs, fhe minimum). For example, a Ab = min{a, 6}. If 5 is empty, we adopt the 
convention that inf iS = +oo. 

If / is a function from S to K, we denote hy f{S) its range: 

/(iS) = {t such that t = f{s) for some s € <S}. 

We will denote the infimum of this set by the two equivalent notations 

inf/(5) = inf{/(s)}. 
sES 

We will also often use the following property. 

Theorem 3.1.1 (“Fubini” formula for infimum). Let S be a nonempty subset of 
IR, and f be a function from S to M. Let be a collection of subsets of S, 

whose union is S. Then 

= inf I mf {/(s„)}| • 

sES n€N j 



Proof: By definition of an infimum, for any sets iS„, 



inf I [J I = inf {inf 5„) . 



On the other hands, since U„iS„ = S, 



so that 



in: 

ses 



/( U'5") = 

VnSN / nSN 



f{/(s)} = inf/(5)=inf/ |J5„ 



VnGl^ 



= inf < U f " f 

^ mGN 

InGN ) 

= inf I inf {/(s)}| • 

kGN \sG<S„ ^ ^ j 



□ 
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3.1.2 Dioid (M U {+oo}, A, +) 

In traditional algebra, one is used to working with the algebraic structure (K, +, x), 
that is, with the set of reals endowed with the two usual operations of addition and 
multiplication. These two operations possess a number of properties (associativity, 
commutativity, distributivity, etc) that make (K, +, x ) a commutative field. As men- 
tioned above, in min-plus algebra, the operation of ‘addition’ becomes computation 
of the infimum (or of the minimum if it exists), whereas the one of ‘multiplication’ 
becomes the classical operation of addition. We will also include -boo in the set of 
elements on which min-operations are carried out, so that the structure of interest is 
now (K U {-boo}, A, -b). Most axioms (but not all, as we will see later) defining a 
field still apply to this structure. For example, distribution of addition with respect 
to multiplication in conventional (‘Plus-times’) algebra 

(3 -b 4) X 5 = (3 X 5) -b (4 X 5) = 15 -b 20 = 35 
translates in min-plus algebra as 

(3 A 4) -b 5 = (3 -b 5) A (4 -b 5) = 8 A 9 = 8. 

In fact, one easily verifies that A and -b satisfy the following properties: 

• (Closure of A) For all a, 6 € IR U j-boo), a A 6 € IR U j-boo). 

• (Associativity of A) For all a, 6, c € IR U j-boo), {a A b) A c = a A {b A c). 

• (Existence of a zero element for A) There is some e = -boo € IR U j+oo) 
such that for all a € IR U j-boo), a At = a. 

• (Idempotency of A) For all a € IR U {-boo}, a Aa = a. 

• (Commutativity of A) For all a, 6 € IR U j-boo), a Ab = b A a. 

• (Closure of -b) For all a, 6 € IR U j-boo), a -b 6 G IR U j-boo). 

• (Associativity of -b) For all a,b,c & IR U j-boo), (a -b 6) -b c = a -b (6 -b c). 

• (The zero element for A is absorbing for -b) For all a G IRU {+ oo), a-be = 
e = e -b a. 

• (Existence of a neutral element for -b) There is some u = 0 G IR U j+oo) 
such that for all a G IR U j-boo), a-bu = a = u-ba. 

• (Distributivity of -b with respect to A) For all a, 6, c G IRU j-boo), (a A 6) -b 
c = (a -b c) A (6 -b c) = c -b (a A 6). 

A set endowed with operations satisfying all the above axioms is called a dioid. 
Moreover as -b is also commutative (for all a, 6 G IR U j-boo), a -b 6 = 6 -b a), the 
structure (KU j-boo), A, -b) is a commutative dioid. All the axioms defining a dioid 
are therefore the same axioms as the ones defining a ring, except one: the axiom of 
idempotency of the ‘addition’, which in dioids replaces the axiom of cancellation of 
‘addition’ in rings (i.e. the existence of an element (—a) that ‘added’ to a gives the 
zero element). We will encounter other dioids later on in this chapter. 
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3.1.3 A Catalog of Wide-sense Increasing Functions 

A function / is wide-sense increasing if and only if /(s) < f{t) for all s < t. 
We will denote by Q the set of non-negative wide-sense increasing sequences or 
functions and by T denote the set of wide-sense increasing sequences or functions 
such that f(L) = 0 for f < 0. Parameter t can be continuous or discrete: in the latter 
case, / = e Z} is called a sequence rather than a function. In the former 

case, we take the convention that the function / = {/(f), f € IR} is left-continuous. 
The range of functions or sequences of T and Q is K+ = [0, -boo]. 

Notation / -b (respectively f !\g) denotes the point-wise sum (resp. minimum) 
of functions / and g\ 



(/ + fl')W = /W + fl'W 

(/Afif)(f) = f{t)Ag{t) 

Notation / < (=, >)g means that f{t) < (=, >)g{t) for all t. 

Some examples of functions belonging to T and of particular interest are the 
following ones. Notation [a;]"'' denotes maxja;, 0), [a;] denotes the smallest integer 
larger than or equal to x. 

Definition 3.1.1 (Peak rate functions Xr). 



XR{t) — 



Rt ift > 0 
0 otherwise 



for some R > 0 ( the ‘rate ’). 

Definition 3.1.2 (Burst delay functions St)- 

-boo ift > T 



Srit) = 



0 



otherwise 



for some T > 0 ( the ‘delay ’). 



Definition 3.1.3 (Rate-latency functions I3r,t)- 



l3R,T(t) = R[t-T]+ = I 



R(t - T) ift > T 
0 otherwise 

for some R > 0 ( the ‘rate ’) and T > 0 ( the ‘delay 

Definition 3.1.4 (Affine functions ^r,b)- 

. , _ J rt -\-b ift > 0 

Tr,b\ J ■! Q otherwise 

for some r > 0 (the ‘rate’) and h > 0 (the ‘burst’). 




3.1. MIN-PLUS CALCULUS 



129 



Definition 3.1.5 (Step Function vt )- 

_ / 1 ift>T 

^T\) {t>T} I Q otherwise 

for some T > 0. 



Definition 3.1.6 (Staircase Functions ut , t )' 



UT,t 





ift > 0 
otherwise 



for some T > 0 (the ‘interval’) and 0 < t < T (the ‘tolerance’). 



These functions are also represented in Figure 3.1. By combining these basic 
functions, one obtains more general piecewise linear functions belonging to IF. For 
example, the two functions represented in Figure 3.2 are written using A and + from 



affine functions and rate-latency functions as follows, with r 1 > r 2 > . 


. . > ri and 


bi <b 2 < ... <bi 






h = 


lr,M ^ • --Irifii = min 

Ki<I 


(3.1) 


/2 = 


^R A {/3ij,2T + RT} A {/3ij,4T + 2i?T} A . . . 




= 


inf {(dRfiiT + iRT~\ . 
«>0 


(3.2) 



We will encounter other functions later in the book, and obtain other representations 
with the min-plus convolution operator. 



3.1.4 Pseudo-inverse of Wide-sense Increasing Functions 

It is well known that any strictly increasing function is left-invertible. That is, if for 
any t\ < t2, f{ti) < f{t2), then there is a function f~^ such that = 

t for all t. Flere we consider slightly more general functions, namely, wide-sense 
increasing functions, and we will see that a pseudo-inverse function can defined as 
follows. 

Definition 3.1.7 (Pseudo-inverse). Let f be a function or a sequence of T. The 
pseudo-inverse of f is the function 

= inf {t such that f(t) > x} . (3.3) 

For example, one can easily compute that the pseudo-inverses of the four func- 
tions of Definitions 3.1.1 to 3.1.4 are 

, 5-1 = So AT 

!^r]t = ll/R,T 

lr,b ~ l^llr,b- 

The pseudo-inverse enjoys the following properties: 
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Peak rate function Burst-delay function 




Staircase function Step function 




Figure 3.1: A catalog of functions of T: Peak rate function (top left), burst- 
delay function (top right), rate-latency function (center left), affine function 
(center right), staircase function (bottom left) and step function (bottom 
right). 
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Figure 3.2; Two piecewise linear functions of T as defined by (3.1) (left) and 
(3.2) (right). 

Theorem 3.1.2 (Properties of pseudo-inverse functions). Let f £ T, x,t > 0. 

• (Closure) f~^ € T and /“^(O) = 0. 

• (Pseudo-inversion) We have that 

f(t)>x f~^(x)<t (3.4) 

f~^(x)<t f{t)>x (3.5) 

• (Equivalent definition ) 

f~^(x) = sup {t such that f(t) < x} . (3.6) 

Proof: Define subset Sx = {t such that f(t) > a;} C K+ . Then (3.3) becomes 

f~^(x) = inf iSj;. (Closure) Clearly, from (3.3), f~^(x) = 0 for a; < 0 (and in 

particular /“^(O) = 0). Now, let 0 < a;i < X 2 . Then Sxi D Sx 2 , which implies 
that inf iSa;j < infiS^j^ and hence that (a;i) < f~^{x 2 ). Therefore/"^ is wide- 
sense increasing. (Pseudo-inversion) Suppose first that f(t) > x. Then t G Sx, and 

so is larger than the infimum of Sx , which is f~^ (x) : this proves (3.4). Suppose next 
that /-I (a;) < t. Then t > inf Sx, which implies that t € iS^;, by definition of an 
infimum. This in turn yields that f(t) > x and proves (3.5). (Equivalent definition) 

Define subset Sx = {t such that f(t) < a;} C K+ . Pick t & Sx and i & Sx. Then 
f(i) < f{t), and since / is wide-sense increasing, it implies that i <t. This is true 
for any t G Sx and i G Sx, hence sup Sx < inf Sx . As U = K+ , we cannot 
have sup Sx < inf Sx . Therefore 

SUPiSa; = inf^a; = f~^{x). 

□ 
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3.1.5 Concave, Convex and Star-shaped Functions 

As an important class of functions in min-plus calculus are the convex and concave 
functions, it is useful to recall some of their properties. 

Definition 3.1.8 (Convexity in K" ). Let u be any real such that 0 < u < 1. 

• Subset iS C K" is convex if and only ifux + (1 — u)y € S for all x,y G S. 

• Function f from a subset D C K" to IR is convex if and only if f{ux + (1 — 
u)y) < M/(a;) + (1 - u)f{y) for allx,y £ D. 

• Function f from a subset "D C K" to K is concave if and only if—f is convex. 

For example, the rate-latency function (Fig 3.1, center left) is convex, the piece- 
wise linear function /i given by (3.1) is concave and the piecewise linear function 
/2 given by (3.2) is neither convex nor concave. 

There are a number of properties that convex sets and functions enjoy [68]. 
Here are a few that will be used in this chapter, and that are a direct consequence of 
Definition 3.1.8. 

• The convex subsets of K are the intervals. 

• If iSi and S 2 are two convex subsets of K" , their sum 

iS = iSi -I- 1 S 2 = {s G IR" I s = Si -I- S 2 for some si G <Si and S 2 G 1 S 2 } 
is also convex. 

• Function / from an interval [a, h\ to K is convex (resp. concave) if and only if 
f{ux + (1 — u)y) < (resp. >) uf{x) + (1 — u)f{y) for all a;, y G [a, h] and 
allu G [0.1]. 

• The pointwise maximum (resp. minimum) of any number of convex (resp. 
concave) functions is a convex (resp. concave) function. 

• If iS is a convex subset of , n > 1, the function from K" to K defined by 

/(a;) = inf{/i G IR such that {x,y) G 5} 



is convex. 

• If / is a convex function from K" to K, the set S defined by 
S = {{x,if) G such that f{x) < jf] 



is convex. This set is called the epigraph of /. It implies in the particular case 
wheren = 1 that the line segment between {a, /(a)} and {6, /(6)} lies above 
the graph of the curve y = f{x). 
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The proof of these properties is given in [68] and can he easily deduced from Def- 
inition 3.1.8, or even from a simple drawing. Chang [10] introduced star-shaped 
functions, which are defined as follows. 

Definition 3.1.9 (Star-shaped fnnction). Function f G IF is star-shaped if and 
only iff(t) /t is wide-sense decreasing for all t > 0. 

Star-shaped enjoy the following property: 

Theorem 3.1.3 (Minimum of star-shaped functions). Let /, g be two star-shaped 
functions. Then h = f A g is also star-shaped. 

Proof: Consider some t > 0. If /i(t) = f(f), then for all s > t, h{t)lt = 

^ /(s)/s > h{s)js. The same argument holds of course if h{t) = g{t). 
Therefore h{t)jt > h{s)js for all s > t, which shows that h is star-shaped. □ 
We will see other properties of star-shaped functions in the next sections. Let us 
conclude this section with an important class of star-shaped functions. 

Theorem 3.1.4. Concave functions are star-shaped. 

Proof: Let / be a concave function. Then for any u € [0, 1] and x,y >0, f{ux -f 

(1 — u)y) > uf(x) + (1 — u)f(y). Take a; = t,y = 0 andu = s/t, with 0 < s < L 
Then the previous inequality becomes /(s) > {s/t)f{t), which shows that f{t)/t 
is a decreasing function of t. □ 

On the other hand, a star-shaped function is not necessarily a concave function. 
We will see one such example in Section 3.1.7. 



3.1.6 Min-plus Convolution 

Let f{t) be a real- valued function, which is zero for f < 0. If f € IR, the integral of 
this function in the conventional algebra (K, -h, x ) is 

f{s)ds 

which becomes, for a sequence f{t) where f € Z, 

tm- 

s^O 

In the min-plus algebra (K U {-foo}, A, -f), where the ‘addition’ is A and the ‘mul- 
tiplication’ is -f, an ‘integral’ of the function / becomes therefore 




inf 

such that 0<s<t 



ifis)}, 



which becomes, for a sequence f{t) where f € Z, 
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min 

sEZ such that 0<s<t 

We will often adopt a shorter notation for the two previous expressions, which is 

0<s<t 

with s e Z or s e IR depending on the domain of /. 

A key operation in conventional linear system theory is the convolution between 
two functions, which is defined as 

/ + 00 

f{t - s)g{s)ds 

-CO 

and becomes, when f(t) and g(t) are two functions that are zero for f < 0, 

if^g)(t)=[ fit - s)g(s)ds. 

Jo 

In min-plus calculus, the operation of convolution is the natural extension of the 
previous definition: 

Definition 3.1.10 (Min-plus convolution). Let f and g be two functions or se- 
quences of IF. The min-plus convolution of f and g is the function 



if ® g)it) = inf {fit - s) -f gis)} . ( 3 . 7 ) 

0<s<t 

ilft < 0, if <^g) it) = 0). 

Example. Consider the two functions 7^,;, and ( 3 r^t, with 0 < r < R, and let us 
compute their min-plus convolution. Let us first compute it for 0 <t <T. 

ilr,b® (iR,T)if) = mi {-ir,bit-s)-\-R[s-T\^] 

= inf {'yr,bit - s) -I- 0} = 7r,(>(0) -l-0 = 0-l-0 = 0 

Q<s<t 

Now, lit > T, one has 

ilr,b ^l 3 R,T)it) 

= inf [jr,biit - s) + R[s -T]~^} 

= inf {7^ (,(f - s) -I- i?[s - T] + | A inf {7^ ;,(f - s) -f i?[s - T] + | 

A inf {^r bit - s) i?[s - T]~^} 

s=t 

= inf {6 -I- r(f — s) -I- 0 } A inf {b rU — s) RU — T)} 

0<s<T T<s<t 

A{ 0 -|-i?(f-T)} 

= {6-|-r(f -T)} A -\- rt - RT -\- ^ini^^iiR - r)s}^ A {Rf - T)} 

= {b-\-rit-T)}A{b-\-rit-T)}A{Rit-T)} 

= {b-\-rit-T)}A{Rit-T)}. 
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The result is shown in Figure 3.3. Let us now derive some useful properties for the 




Figure 3.3; Function 7 ^,?, ® (3r,t when 0 < r < R. 



computation of min-plus convolution. 

Theorem 3.1.5 (General properties of ®). Let f,g,h£T. 

• Rule 1 (Closure of®) (f ® g) £ T. 

• Rule 2 (Associativity of ®) (f ® g) ® h = f ® (g ® h). 

• Rule 3 (The zero element for A is absorbing for ®) The zero element for A 
belonging to T is the function e, defined as e(f) = +00 for all t > 0 and 
e(f) = D for all t < 0. One has f ® s = s. 

• Rule 4 (Existence of a neutral element for ®) The neutral element is <5o, as 

f®So=f. 

• Rule 5 (Commutativity of®)f®g = g®f. 

• Rule 6 (Distributivity of® with respect to A) (f Ag)®h= {f®h)A(g® h). 

• Rule 7 (Addition of a constant) For any K £ K+, {f-£K)®g = (f®g) + K. 

The proof of these rules is easy. We prove the two first rules, the proof of the 
five others are left to the reader. 

Proof: (Rule 1) Since / is wide-sense increasing, 

f(ti - s) + g(s) < f(t 2 - s) + g{s) 
for all 0 < <t 2 and all s € IR. Therefore 
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inf {/(fi - s) + g{s)} < inf {/(i 2 - s) + g{s)} 
and as f{t) = g{t) = 0 when t < 0, this inequality is equivalent to 

„ inf {/(fi - s) + fl'(s)} < inf {/(i2 - s) + fl'(s)} , 

0<s<tl 0<S<t2 

which shows that (/ ® g){ti) < (/ ® g){t 2 ) for all 0 < < t 2 - (Rule 2) One has 






= inf 1 


inf {fit-s-u)-\-giu)}-\-h(s)\ 


0<s<t [ 


0<u<t-s ) 


= inf 1 


inf {fit -u') -\- g(u' - s) -\- h(s)}\ 


0<s<t [ 


s<u'<t J 


= inf ■ 


( + Hs)}] 


0<u' <t 


(0<s<«' J 


= inf ■ 


(/(f-u')+ inf - s) + /i(s)}] 


0<ii' <t 


( 0<s<«' J 


= ^ inf {fit - u') + ig® ti)(u')} 

0<u' <t 


= if ®ig®h))if). 



□ 

Rules 1 to 6 establish a structure of a commutative dioid for (JF, A, ®), whereas 
Rules 6 and 7 show that ® is a linear operation on (K+ , A, +). Now let us also 
complete these results by two additional rules that are helpful in the case of concave 
or convex functions. 

Theorem 3.1.6 (Properties of ® for concave/convex functions). Let f,g& T. 

• Rule 8 (Functions passing through the origin) If f{0) = g{0) = 0 then 
f ® g ^ f g- Moreover, if f and g are star-shaped, then f ® g = f /\ g- 

• Rule 9 (Convex functions) If f and g are convex then f ® g is convex. In 
particular if f, g are convex and piecewise linear, f '^g is obtained by putting 
end-to-end the different linear pieces of f and g, sorted by increasing slopes. 

Since concave functions are star-shaped, Rule 8 also implies that if /, g are con- 
cave with /(O) = (ji(0) = 0, then f ® g = f g. 

Proof: (Rule 8) As /(O) = ^/(O) = 0, 

{! ®g){t) = g(f) ^ mi {f(t - s) -\-g{s)} A f(t) < f(t) Ag(t). (3.8) 

0<s<t 

Suppose now that, in addition, / and g are star-shaped. Then for any t > 0 and 
0 < s <t f(t — s) > (1 — s/t)f(t) and g(s) > (s/t)g(t), so that 



fit - s) -h g(s) > fit) -i islt)ig(t) - fit)). 
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Now, as 0 < s/i < 1, f{t) + {s/t){g{t) — f{t)) > f{t) A g{t) so that 
fit - s) + 9{s) > fit) A git) 

for all 0 < s < f. Combining this inequality with (3.8), we obtain the desired result. 
(Rule 9) The proof uses properties of convex sets and functions listed in the previous 

subsection. The epigraphs of / and g are the sets 

> 5 i = {(si, /ri) G such that /(si) < /ri} 

^2 = {is2,P2) G such that gis2) < 1^2} 

Since / and g are convex, their epigraphs are also convex, and so is their sum S = 
Si + 1 S 2 , which can be expressed as 

S = G I for some {s,0 G [0,f] x [0,/r],/(f - s) < n - ^,gis) < 

As S is convex, function hit) = inf{/r G IR such that (f , /r) G 5} is also convex. 
Now h can be recast as 

hit) 

= inf{/r G IR | forsome(s,^) G [0,f] x [0,/r],/(f — s) < ji — ^,gis) < 

= inf {/r G IR I for some s G [0,t], fit — s) + gis) < /r} 

= - s) +fif(s),s G [0,f]} 

= if'^9)it), 

which proves that (/ ® g) is convex. 

If / and g are piecewise linear, one can construct the set 5 = iSi + 1 S 2 , which is 
the epigraph of / ® < 7 , by putting end-to-end the different linear pieces of / and g, 
sorted by increasing slopes [ 20 ]. 

Indeed, let h' denote the function that results from this operation, and let us 
show that h' = f ® g. Suppose that there are a total of n linear pieces from / 
and g, and label them from 1 to n according to their increasing slopes: 0 < ri < 
T 2 < ... < r„. Figure 3.4 shows an example for n = 5. Let T, denote the length 
of the projection of segment i onto the horizontal axis, for 1 < i < n. Then the 
length of the projection of segment i onto the vertical axis is r ,T,. Denote by S' the 
epigraph of h' , which is convex, and by dS' its boundary. Pick any point (f, h' it)) 
on this boundary dS' . We will show that it can always be obtained by adding a point 
it — s,fit — s)) of the boundary 3iSi ofNi andapoint (s,gis)) of the boundary 352 
of S 2 . Let k be the linear segment index to which {t, h'it)) belongs, and assume, 
with no loss of generality, that this segment is a piece of / (that is, k C 35i). We 
can express h'it) as 



k-l k-l 

h'it) = Tkit - '^Ti) -f 



(3.9) 
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Figure 3.4: Convex, piecewise linear functions / (and its epigraph <Si (top 
left)), g (and its epigraph S 2 (top right)), and f g (and its epigraph S = 
Si + iS 2 (bottom)). 




3.1. MIN-PLUS CALCULUS 



139 



Now, let s be the sum of the lengths of the horizontal projections of the segments 
belonging to g and whose index is less than k, that is, 

s = 5] T,. 

A: — 1 

Then we can compute that 

k-\ k-\ 

t-s = t~Y^Ti + Y,Ti- Y. 

i—1 i—1 — 1 

k-l 

= f - ^ T’j + 'Y Ti 

i—1 ,l<i<A: — 1 

and that 

k-l 

f{t-s) = rk{t-Y'^i)+ Y 

i—1 iC9<Si ,l<i<A: — 1 

9(s) = Y 

iC9<S2,l;<i:<A: — 1 

The addition of the right hand sides of these two equations is equal to h'{t), because 
of (3.9), and therefore f(t — s) + g{,s) = h'{t). This shows that any point of dS' 
can be broken down into the sum of a point of dS i and of a point of 882 , and hence 
that 88 ' = 3iSi + 882 , which in turn implies that 5' = iSi + 1 S 2 = 8 . Therefore 
h' = f^g. □ 

The last rule is easy to prove, and states that ® is isotone, namely: 

Theorem 3.1.7 (Isotonicity of ®). Let f,g,f',g' £ IF. 

• Rule 10 (Isotonicity) If f < g and /' < g' then f ® f < g ® g' . 

We will use the following theorem: 

Theorem 3.1.8. For f and g in T, if in addition g is continuous, then for any t there 
is some to such that 

if ^ 9)(t) = fi(to) + g(t - to) (3.10) 

where fi(to) = f(s) is the limit to the left of f at to. If f is left- 

continuous, then f I (to) = f(to)- 

Proof: Fix t. There is a sequence of times 0 < s„ 8 t such that 

inf (/(io) + - fo)) = lim (/(s„) + - s„)) (3.11) 

to<t n— >-oo 

Since 0 < s„ < f . we can extract a sub-sequence that converges towards some value 
to. We take a notation shortcut and write lim„_j.oo Sn = fo- If / is continuous, the 
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right hand-side in 3.11 is equal to //(io) + ~ to) which shows the proposition. 

Otherwise / has a discontinuity at to - Define S = /(fo) ~ fi{to)- We show that we 
can again extract a subsequence such that s„ < to - Indeed, if this would not be true, 
we would have s„ > fo for all but a finite number of indices n. Thus for n large 
enough we would have 

and by continuity of g: 

g{t - 

thus 

f{s„)-\-g{t - 

Now 

fi{to) + g{t 

thus 

f(sn) + g(t - Sn) > inf (f(s) -b g(t - s)) + - 

S<t Z 

which contradicts 3.11. Thus we can assume that s„ < to for n large enough and 
thus lim„^oo f{sn) = fi(to)- □ 

Finally, let us mention that it will sometimes be useful to break down a some- 
what complex function into the convolution of a number of simpler functions. For 
example, observe that the rate-latency function can be expressed as 

I3r,t = St Xr- (3.12) 

3.1.7 Sub-additive Functions 

Another class of functions will be important in network calculus are sub-additive 
functions, which are defined as follows. 

Definition 3.1.11 (Sub-additive function). Let f be a function or a sequence oftF- 
Then f is sub-additive if and only if f{t -b s) < f{t) -b f{s) for all s, f > 0. 

Note that this definition is equivalent to imposing that / < / ® /. If /(O) = 0, 
it is equivalent to imposing that / ® / = /. 

We will see in the following theorem that concave functions passing through the 
origin are sub-additive. So the piecewise linear function /i given by (3.1), being 
concave and passing through the origin, is sub-additive. 

The set of sub-additive functions is however larger than that of concave func- 
tions: the piecewise linear function /2 given by (3.2) is not concave, yet one check 
that it verifies Definition 3.1.11 and hence is sub-additive. 

Contrary to concave and convex functions, it is not always obvious, from a quick 
visual inspection of the graph of a function, to establish whether it is sub-additive or 
not. Consider the two functions (Sr^t + K' and (Sr^t + K" , represented respectively 



^Sn) > fl{to) + S 
-Sn) > g{t-to) - - 

- Sn) > fl(to) +g(t -to) + - 

- to) > inf (/(s) -b g{t - s)) 

S<t 
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on the left and right of Figure 3.5. Although they differ only by the constants K ' and 
K”, which are chosen so that 0 < K” < RT < K' < +oo, we will see + K' 
is sub-additive but not I3r^t + K" . Consider first K' . If ,s 1 < T, then 




Figure 3.5; Functions (3r^t + K' (left) and (3r^t + K" (right). The only differ- 
ence between them is the value of the constant: K" < RT < K'. 



s,t <T and 

^r,t{s + t) + K' = K' < 2K' = {I3r,t{s) + K') + (/ 3 fl . T ( i ) + K'). 
On the other hand, if s -|- f > T, then, since K' > RT, 



I^R,T(t -F s) -F 7F' = 

< 



< 



i?(f -F s - T) -F K' 
R{s-ft-T)-fK' -f [K' - RT) 
{R{t - T) -F K') -F {R{s - T) -F K') 
{(^R,T(t) -F K') -F {I3r,t{s) + K'), 



which proves that (3r^t + K' is sub-additive. Consider next (iR^r + K" . Pick s = T 
and t > T. Then, since K" < RT, 



I^R,T(t -F s) -F K" = 

!3R,T{t -F T) -F K” = Rt-f K” = R{t - T) -F i?T -F K” 

> R{t - T) + K” + K” = {flR,T{t) + K") + {(3r,t{s) + K"), 

which proves that (iR^r + K" is not sub-additive. 

Let us list now some properties of sub-additive functions. 

Theorem 3.1.9 (Properties of sub-additive functions). Let f,g £ T. 

• (Star-shaped functions passing through the origin) If f is star- shaped with 
/(O) = 0, then f is sub-additive. 

• (Sum of sub-additive functions) If f and g are sub-additive, so is (/ -F g). 
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• (Min-plus convolution of sub-additive functions) Iff and g are sub-additive, 
so is (/ ® fif). 

The first property also implies that concave functions passing through the origin 
are sub-additive. The proof of the second property is simple and left to the reader, 
we prove the two others. 



Proof: (Star-shaped functions passing through the origin) Let s,t > 0 be given. 

If s or f = 0, one clearly has that /(s -ft) = /(s) -f f{t). Assume next that s,t > 0. 
As / is star-shaped. 



/(s) > 

s -\-t 

fit) > -^fis+t) 

s -\-t 

which sum up to give /(s) +/(i) > f(s-i-t). (Min-plus convolution of sub-additive 
functions) Let s, f > 0 be given. Then 

if <^9){s) + if <»g) it) 

= ifi^-^)+siu)}+ ini {fit- v)-\-g{v)} 

0<u<s 0<v<t 

= inf inf {/(s -u)-\- f{t - t;) -b g{u) + g(v)} 

0<u<s 0<v<t 

- + t - iu -\- v)) -\- g(u -b t;)} 

0<U<S 0<V<t 

= ^Afis+t-iu3-v))3-g(u-i-v)} 

= if<^g)it + s). 



□ 

The minimum of any number of star-shaped (resp. concave) functions is still 
a star-shaped (resp. concave) function. If one of them passes through the origin, it 
is therefore a sub-additive function: for example, as already mentioned earlier, the 
concave piecewise linear function /i given by (3.1) is sub-additive. On the other 
hand the minimum of two sub-additive functions is not, in general, sub-additive. 
Take for example the minimum between a rate latency function (3 w and function 
/2 given by (3.2), when R' = 2i?/3. with R, T as defined in (3.2). Both functions 
are sub-additive, but one can check that fi is not. 

The first property of the previous theorem tells us that all star-shaped functions 
are sub-additive. One can check for example that (3r^t+K' is a star-shaped function 
(which is not concave), but not (3r^t + K" . One can also wonder if, conversely, all 
sub-additive functions are star-shaped. The answer is no: take again function /2 
given by (3.2), which is sub-additive. It is not star-shaped, because /(2T)/2T = 
i?/2 < 2i?/3 = /(3T)/3T. 
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3.1.8 Sub-additive Closure 

Given a function f G IF, if /(O) = 0, then / > / ® / > 0. By repeating this oper- 
ation, we will get a sequence of functions that are each time smaller and converges 
to some limiting function that, as we will see, is the largest sub-additive function 
smaller than / and zero in f = 0, and is called sub-additive closure of /. The formal 
definition is as follows. 

Definition 3.1.12 (Sub-additive closure). Let f be a function or a sequence of IF. 
Denote the function obtained by repeating {n — 1) convolutions of f with itself. 

By convention, f^^^ = <5o, so that f^^^ = f, = / ®/. ^tc- Then the sub-additive 
closure of f, denoted by /, is defined by 

7 = <^oA/A(/®/)A(/®/®/)A...= mf {/(”)}. (3.13) 

Example. Let us compute the sub-additive closure of the two functions (i r,t + K' 
and + K" , represented respectively on the left and right of Figure 3.5. Note 
first that Rule 7 of Theorem 3.1.5 and Rule 9 of Theorem 3.1.6 yield that for any 
IT > 0, 

{Pr,t + K) ig) (Pr^t + K) = (Pr^t ® I3r,t) + 27T = I3r,2T + 27T. 
Repeating this convolution n times yields that for all integers n > 1 
(Pr,t + + nK. 

Now, if K = K' > RT and t < riT, 

(3R,nT + nK' = nK' > (n - l)i?T -f K' = R{nT -T)-fK' 

> R[t-T]+ + K' =fiR^T + K', 

whereas if f > nT 

l3R,nT+nK' = R{t - nT) -f nK' = R{t - T) -f {n - 1){K' - RT) -f K' 
> R{t -T)-fK' = (iR^T + K' 

so that (Pr^t + K'Y"^ > (3r^t + K' for all n > 1. Therefore (3.13) becomes 
I3r,t -i- K' = So a inf | (Pr^t + K'Y'^^ | = <5o A (Pr,t + K'), 

n>l I J 

and is shown on the left of Figure 3.6. On the other hand, if K = K" < RT, the 
infimum in the previous equation is not reached in n = 1 for every f > 0, so that 
the sub-additive closure is now expressed by 

I3r,t + K" = SoA inf | {(Ir^t + K"Y'^^ | = <5o A inf {{I3r,„t + nK")} , 

n>l I J n>l 

and is shown on the right of Figure 3.6. 

Among all the sub-additive functions that are smaller than / and that are zero in 
t = 0, there is one that is an upper bound for all others; it is equal to / , as established 
by the following theorem. 
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Figure 3.6; The sub-additive closure of functions (3r^t + K' (left) and (3r^t + 
K" (right), when K" < RT < K' . 



Theorem 3.1.10 (Sub-additive closure). Let f be a function or a sequence of T, 
and let f be its sub-additive closure. Then (i) f < f, f & T and f is sub-additive, 
(ii) if function g £ IF is sub-additive, with gi(0) = 0 and g < f, then g < f. 

Proof: (i) It is obvious from Definition 3.1.12, that / < /. By repeating (n — 1) 

times Rule 1 of Theorem 3.1.5, one has that /(") £ IF for all n > 1. As = 
So & IF too, / = inf„>o{/^"^} G IF. Let us show next that / is sub-additive. For 
any integers n,m >0, and for any s,t > 0, 

0<u<t-\-s 



so that 



/(f-hs) = inf {/(”+™)(f-f s)} = inf + 

n+m>0 n,m>0 

< inf |/(")(f)+/(™)(s)} 
n,m>0 

= mf l/(”)(f)} + inf {/(™)(s)} = 7(f) +7(s) 

n>0 m>0 

which shows that / is sub-additive, (ii) Next, suppose that g G IF is sub-additive, 
g{0) = 0 and g < f. Suppose that for some n > 1, > g. Clearly, this holds 

for n = 0 (because (ji(0) = 0 implies that g < So = and for n = 1. Now, this 

assumption and the sub-additivity of g yield that for any 0 < s < f , / (f — s) -I- 
/(s) > g{t — s) -b g{s) > g{f) and hence that > g{f). By recursion on 

tt, > g for all n > 0, and therefore / = inf„>o{/^"^} > g. □ 

Corollary 3.1.1 (Sub-additive closure of a sub-additive function). Let f £ IF. 

Then the three following statements are equivalent: (i) f{0) = 0 and f is sub- 
additive (ii) f ^ f = f (Hi) f = f. 
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Proof: (i) => (ii) follows immediately from from Definition 3.1.11. (ii) => (iii): 

first note that f ® f = f implies that /(") = / for all n > 1. Second, note 
that (/ ® /)(0) = /(O) + /(O), which implies that /(O) = 0. Therefore / = 
inf„>o{/^"^} = So /\ .f = f . (iii) => (i) follows from Theorem 3.1.10. □ 

The following theorem establishes some additional useful properties of the sub- 
additive closure of a function. 

Theorem 3.1.11 (Other properties of suh-additive closure). Let f,g £ T 

• (Isotonicity) If f < g then f <g. 



• (Sub-additive closure of a minimum) f /\ g = f ® g. 

• (Sub-additive closure of a convolution) f ® g >/®5-///(0) = <l(0) =0 
then f ® g = f ® g. 



Proof: (Isotonocity) Suppose that we have shown that for some n > 1, / (") > 

gin-) (Clearly, this holds forn = 0 and for n = 1). Then applying Theorem 3. 1 .7 we 
get 

j(n-tl) ^ j(n) ^ J > gin) ^ ^ ^ g(n+l) ^ 

which implies by recursion on n that f < g. (Sub-additive closure of a minimum) 
One easily shows, using Theorem 3.1.5, that 

(/ A g)^‘^'> = (/ ® /) A (/ ® 5f) A (fif ® g). 



Suppose that we have shown that for some n > 0, the expansion of (/ A g) (") is 
(/A = 

/(”) A (/(”-!) ^g)A ® gin) ^ 

inf {/(”“*) I . 

0<k<n I'' J 



Then 

(/A<7)(”+i) 



(/ A fif) g 


3 (/A 




/®(/ A (?)(”)} 


A (/A^/)i”i} 


inf |j 


r(n+l-k) ^ 


giU 


|a inf (/(”- 


-k) ^^(*+l)| 


0<k<n 1 




J 0<k<n 1 


inf {j 


r("+l-*) 


giU) 


|a inf if 


■in-hl-k') 


0<k<n 1 




J l<k'<n-t-l 1 


inf 

0<k<n+l 


1 jin-Vl-k) 




>)| 





which establishes the recursion for all n > 0. Therefore 
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f 



inf 


inf 




■*) ® 


= inf inf {/(”-*) 


n >0 


VI 

VI 

0 


V 


k>0n>k 1 


inf 


inf |i 


'(') ® g 


(*) 1 = inf 


|inf{/«}®(7«| 


*>0 


/>o 1 


J *>0 


[/>0 J 


inf 

*>0 


{ 7 ®; 


,i‘i) . 


= /® inf{<7«} = /®5. 
k>0 



(Sub-additive closure of a convolution) Using the same recurrence argument as 
above, one easily shows that (/ ® ® g^'^K and hence that 

7®^ = inf I (/® <?)(”) I = mf 1 7”)® 7”^ I 

n>0 I J n>0 I J 

> inf I /(”) ® I 

n,m>0 I J 

If/(0) = g{0) = 0, Rule 8 in Theorem 3.1.6 yields that /®<jf < / A and therefore 
that f ^ g < f A g. Now we have just shown above that f A g = f ^g, so that 

f <^g <7 <^g. 



Combining this result with (3.14), we get f ® g = f ® g- □ 

Let us conclude this section with an example illustrating the effect that a differ- 
ence in taking t continuous or discrete may have. This example is the computation 
of the sub-additive closure of 




if f > 0 

if f < 0 



Suppose first that f € IR. Then we compute that 



(/ ® f){t) = inf {(f - s)^ -b s^} = (f/2)2 -I- (t!2f = f 12 



as the inhmum is reached in s = f/2. By repeating this operation n times, we obtain 
inf {(f — s)^ -b s^/(n — 1)1 = f^/n 

0<s<i '' •' 



as the inhmum is reached in s = f(l — 1/n). Therefore 

f{t) = inf {f^/n} = lim /n = 0. 



Consequently, if f € IR, the suh-additive closure of function / is 



/ = 0 , 
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Figure 3.7: The sub-additive closure of f{t) = when f e K (left) and 

when f e Z (right). 



as shown on the left of Figure 3.7. 

Now, if f e Z, the sequence f{t) is convex and piecewise linear, as we can 
always connect the different successive points {t, f^) for alH = 0, 1, 2, 3, . . .: the 
resulting graph appears as a succession of segments of slopes equal to (2t 1) (the 

hrst segment in particular has slope 1), and of projections on the horizontal axis 
having a length equal to 1, as shown on the right of Figure 3.7. Therefore we can 
apply Rule 9 of Theorem 3.1.6, which yields that / ® / is obtained by doubling 
the length of the different linear segments of /, and putting them end-to-end by 
increasing slopes. The analytical expression of the resulting sequence is 

(/®/)(f) = min {(f-s)2 +g2} = 

U<5<t 

Sequence = f ® f again convex and piecewise linear. Note the hrst segment 
has slope 1, but has now a double length. If we repeat n times this convolution, it 
will result in a convex, piecewise linear sequence / (t) whose hrst segment has 

slope 1 and horizontal length n: 

/i”i(f)=f if0<f<n, 

as shown on the right of Figure 3.7. Consequently, the sub-additive closure of se- 
quence / is obtained by letting n ^ oo, and is therefore f{t) = t for t > 0. 
Therefore, if f € Z , 

7 = Ai. 

3.1.9 Min-plus Deconvolution 

The dual operation (in a sense that will clarihed later on) of the min-plus convolution 
is the min-plus deconvolution. Similar considerations as the ones of Subsection 3.1.1 
can be made on the difference between a sup and a max. Notation V stands for sup 
or, if it exists, for max: aV b = max{a, b}. 
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Definition 3.1.13 (Min-plus deconvolution). Let f and g be two functions or se- 
quences of IF. The min-plus deconvolution of f by g is the function 

if 0 g)(t) = sup {fit -\-u) - g(u)} . (3.15) 

u>0 

If both f{t) and g{t) are infinite for some t, then Equation (3.15) is not defined. 
Contrary to min-plus convolution, function (/ 0 g)it) is not necessarily zero for 
t < 0, and hence this operation is not closed in IF, as shown by the following 
example. 

Example. Consider again the two functions yr,b and (3r^t, with 0 < r < R, and 
let us compute the min-plus deconvolution of yj.,b by Pr,t- We have that 

iTr,b 0 PR,T)it) 

= sup {7r,(>(t -h u) - i?[u - T]+} 

u>0 

= sup {jr, bit + u) — R[u — T]~^{ \/ sup {jr, bit + u) — R[u — T]~^{ 

0<u<T u>T 

= sup {-Jr, bit + u)} V sup {-Jr, bit + u) — Ru RT} 

0<u<T ’ u>T 

= ilr, bit + T)} V sup {-Jr, bit + u) - Ru RT} . (3.16) 

u>T 

Let us first compute this expression for t < —T. Then 'Yr,bit -h T) = 0 and (3.16) 
becomes 

iTr,b 0 PR,T)it) 

= 0 V sup {'Jr,bit + u) — Ru -\- RT} 

T<_u<—t 

V sup {yr,bit + u) — Ru + RT{ 

u>—t 

= 0 V sup {0 — Ru RT} V sup {b r(t u) — Ru RT} 

T<u<—t u>—t 

= OV OV {b-\- Rt-\- RT} = [b-\- R(t -\- T)]+ . 

Let us next compute {'jr,b 0 PR,r)it) for t > —T. Then (3.16) becomes 

iTr,b C) PR,T)it) = {b r it -\-T)}\/ sup {b r it u) - Ru-\- RT} 

u>T 

= {6 -b rit -b T)} V {6 -b rit -b T)} = 6 -b r(f -b T). 

The result is shown in Ligure 3.8. 

Let us now state some properties of 0 (Other properties will be given in the next 
section). 

Theorem 3.1.12 (Properties of 0 ). Let f,g,h£ IF. 

• Rule 11 (Isotonicity of(d) Iff < g, then f (Z) h < g (Z) h and h(Z f >h(Z g. 
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Figure 3.8; Function 7 ^,?, 0 (3r^t when 0 < r < R. 



• Rule 12 (Composition of (Z)) (f (Z) g) (Z) h = f (Z) (g ^ h). 

• Rule 13 (Composition of Z and 0 ) (/ 0 g) C) g < f Z (g Z g). 

• Rule 14 (Duality between 0 and Z) f Z g < h if and only iff < g Z h. 

• Rule 15 (Self-deconvolution) (/ 0 /) is a sub -additive function of T such 
that (/ 0 /)(0) = 0. 

Proof: (Rule 1 1) If / < g, then for any h G IF 

(/ 0 /i)(f) = sup {/(f + u) - /i(u)} < sup {g(t -1- u) - h(u)} = (g Z h)(t) 

u>0 u>0 

(h Z f){t) = sup {h(t -1- u) - f(u)} > sup {h(t -1- u) - g(u)} = (h Z g){t). 

u>0 u>0 



(Rule 12) One computes that 

i{fZg)Zh){t) = sup {{f Z g){t -1- u) - h(u)} 



u>0 



= sup I sup {f (t -\- u -\- v) - g(v)} - h{u)\ 

u>o Lt)>o j 

= sup-^ sup {/(f + r;') 1 

n>0 j 

= sup sup {/(f + r;') - -m) + /i(m)}} 

n>0 v'>u 

= sup sup {f(t-\-v') - {g(v' -u) -\- h(u)}} 

v' >0 0<u<v' 

= sup |/(f + r;') — inf {g(v' — u) -\- h(u)}\ 
v'>0 [ 0<u<v' J 

= sup {fit + v') -{gZ h)(v')} = ifz(gZ h))(t). 

v' >0 
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(Rule 13) One computes that 

= sup{{f <gi g){t -\-u) - g{u)} 

u>0 

= sup inf {f{t-\-u-s)-\-g{s) -g{u)} 

u>0 0<s<t-\-u 

= sup ini {f{t- s') -\-g{s'-\-u)-g{u)} 

n>0 

< sup inf {f{t - s') -i g{s' -iu) - g{u)} 

u>oO<s'<t 

< sup inf \f{t-s')-\-sup{g{s'-\-v)-g{v)}\ 

( v>0 ) 

= |/(*-s') + sup{5f(s'+i;)-5f(i;)}| 

o<s'<t [ „>o J 

= ~ + (a ^ a)(s')} = if ^ (g C) g))(t). 

0<s' <t 

(Rule 14) Suppose first that (/ 0 g){s) < h{s) for all s. Take any s, r; > 0. Then 
/(s + r;) - g{v) < sup {/(s + u) - g{u)} = (/ 0 g){s) < h{s) 

u>0 

or equivalently, 

f{s+v) < g{v) +h{s). 

Let t = s V. The former inequality can be written as 

f{t) < g{t - s) + h{s). 

As it is verified for all f > s > 0, it is also verihed in particular for the value of 
s that achieves the inhmum of the right-hand side of this inequality. Therefore it is 
equivalent to 

fit) < „inf {g{t - s) -h h{s)} = (g 'gi h){t) 

0<s<t 

for all f > 0. Suppose now that for all v, f{v) < (<jf 0 h){v). Pick any t G M.. Then, 
since g,h € IF, 

f{v) < inf {g{v - s) -h h{s)} = inf {g{v - s) -h h{s)} < g{t - v) -\- h{t). 

0<s<v 

Let u = t — V, the former inequality can be written as 

fit-iu) - g{u) < h{t). 

As this is true for all u, it is also verified in particular for the value of u that achieves 
the supremum of the left-hand side of this inequality. Therefore it is equivalent to 

sup {fit + u)- giu)} < hit). 

ueK 
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Now if u < 0, g{u) = 0, so that sup^^o{/(^ + u) — g{u)} = f{t) and the former 
inequality is identical to 

sup {f{t + u) - g{u)} < h{t) 

u>0 

for all t. (Rule 15) It is immediate to check that (/ 0 /)(0) = 0 and that / 0 / is 
wide-sense increasing. Now, 

(/0/)(s) + (/0/)W 

= sup {/(f + u) - f{u)} + sup {/(s + u) - f{v)} 

U>0 V>0 

= sup {f{t + u) - f{u)} + sup {/(s + f + w) - /(f + w)} 

n>0 w>—t 

> sup |sup{/(f + u) - f(u) +/(s + f + w) - /(f + w)} I 

tfi>o Ln>o j 

> sup {f{t + w) - f{w) + /(s + f + w) - f{t + w)} 

= (/ 0 /)(« + *)• 



□ 

Let us conclude this section by a special property that applies to self-deconvolution 
of sub-additive functions. 

Theorem 3.1.13 (Self-deconvolution of sub-additive functions). Let f e T. Then 
/(O) = 0 and f is sub-additive if and only if f (d f = f. 

Proof: {^) If / is sub-additive, then for all t,u > 0, f{t -\-u) — f{u) < f(f) and 

therefore for alH > 0, 

(/ 0 /)(i) = sup {f{t -I- u) - f{u)} < f{f). 

ii>0 

On the other hand, if /(O) = 0, 

(/ 0 /)(i) = sup {f{t -I- u) - f{u)} > f{f) - /(O) = f{f). 

ii>0 

Combining both equations, we get that / 0 / = /. (<:=) Suppose now that / 0 / = /. 
Then/(0) = (/0/)(O) = 0 and for any f, u > 0,/(f) = {f <Z) f){f) > f{t + u) - 
f{u) so that f{t) -h f{u) > /(f -h u), which shows that / is sub-additive. □ 

3.1.10 Representation of Min-plus Deconvolution by Time In- 
version 

Min-plus deconvolution can be represented in the time inverted domain by min- 
plus convolution, for functions that have a hnite lifetime. Function g G Q has a 
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finite lifetime if there exist some finite Tq and T such that g{t) = 0 if f < Tq and 
g{t) = g{T) for f > T. Call Q the subset of Q, which contains functions having a 
hnite lifetime. For function g G we use the notation gi(+oo) as a shorthand for 
suPigR{fif(f)} = limi^+oo g{t)- 



Lemma 3.1.1. Let f & IF be such that limj_j._|_oo /(f) = +oo. For any g & G, gC) f 
is also in Q and (g 0/)(+oo) = ai+oo). 



Proof: Define L = gi(+oo) and call T a number such that g{t) = L for t > T. 

/(O) > 0 implies that g (d f < g(-\-oo) = g(L). Thus 

ig(2) f){t) <LIort>T. (3.17) 

Now since limj_j._|_oo /(f) = +oo, there is some T\ > T such that f(t) > L for 
all t > T\. Now let t > 2T\. \fu > T\, then f{u) > L. Otherwise, u <T\ thus 
t — u > t — Ti >Ti thus g(t — u) > L. Thus in all cases f{u) + g{t — u) > L. 
Thus we have shown that 

{g^ f){t)>L forty 2Ti. (3.18) 

Combining (3.17) and (3.18) shows the lemma. □ 

Definition 3.1.14 (Time Inversion). For a fixed T e [0, +oo[, the inversion opera- 
tor « defined on Q by: 

^T{f){g) = g{+oo) - g{T - t) 

Graphically, time inversion can be obtained by a rotation of 180 ° around the 
point ( Y, )_ It is simple to check that $t(<i) is in Q, that time inversion is sym- 

metrical ($t($t(<i)) = g) and preserves the total value ($t(<i)(+oo) = g{-foo)). 
Lastly, for any a and T, a is an arrival curve for g if and only if a is an arrival curve 
for$T(fl')- 

Theorem 3.1.14 (Representation of Deconvolution by Time Inversion). Let g e 

0, and let T be such that g{T) = g{-\-oo). Let f £ F be such that limj_)._|_oo /(f) = 
-boo. Then 

g(Z>f = ^Ti^T(g)^f) (3.19) 



The theorem says that g (d f can be computed by first inverting time, then com- 
puting the min-plus convolution between /, and the time-inverted function g, and 
then inverting time again. Figure 3.9 shows a graphical illustration. 





m T 



Figure 3.9: Representation of the min-plus deconvolution of gi by / = 7^,/ 
by time-inversion. From top to bottom: functions / and g, function ^r{g) 
function ^rig) ® / and finally function g<Z> f = $T($T(y) ® /)• 
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Proof: The proof consists in computing the right handside in Equation (3.19). 

Call g = We have, by definition of the inversion 

^Ti^rig) ® /) = ^rig ® /) = (5 ® /){+oo) - (5 ® /)(T - t) 

Now from Lemma 3.1.1 and the preservation of total value: 

{g ® /)(+oo) = ^(+ 00 ) = fif(+oo) 

Thus, the right-handside in Equation (3.19) is equal to 

g{+oo) - (5 ® f){T -t)= fif(+oo) - inf {^(T - t - u) -\- f{u)} 

u>0 

Again by definition of the inversion, it is equal to 

fif(+oo) - inf {gf(+oo) - g{t + u) + f{u)} = sup{g{t + u) - f{u)}. 



3.1.11 Vertical and Horizontal Deviations 

The deconvolution operator allows to easily express two very important quantities in 
network calculus, which are the maximal vertical and horizontal deviations between 
the graphs of two curves / and g of JE. The mathematical definition of these two 
quantities is as follows. 

Definition 3.1.15 (Vertical and horizontal deviations). Let f and g be two func- 
tions or sequences of T. The vertical deviation v{f,g) and horizontal deviation 
h( f, g) are defined as 

v(f,g) = sup{f(t) - g(t)} (3.20) 

i>0 

h{f,g) = sup {inf {d > 0 such that /(t) < (ji(t + d)}} . (3.21) 

i>0 

Eigure 3.10 illustrates these two quantities on an example. 

Note that (3.20) can be recast as 

v{f,g) = {f<Zgm (3.22) 

whereas (3.20) is equivalent to requiring that h{f, g) is the smallest d >0 such that 
for alH > 0, /(f) < g{t d) and can therefore be recast as 

h{f, g) = inf {d>0 such that (/ 0 g){—d) < 0} . 

Now the horizontal deviation can be more easily computed from the pseudo-inverse 
of g. Indeed, Definition 3.1.7 yields that 
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Figure 3.10: The horizontal and vertical deviations between functions / and 
9- 



9 = inf {A such that g{A) > f{t)} 

= inf {d > 0 such that g(t + d) > f(t)} + t 

so that (3.21) can be expressed as 

h{f,g) = sup {g-\f{t)) - t} = {g~\f) 0Ai)(O). (3.23) 

i>0 

We have therefore the following expression of the horizontal deviation between / 
and g: 

Proposition 3.1.1 (Horizontal deviation). 

h{f,g) = sup {g-\f{t)) -t} . 
t>o 

3.2 Max-plus Calculus 

Similar definitions, leading to similar properties, can be derived if we replace the 
infimum (or minimum, it is exists) by a supremum (or maximum, if it exists). 
We use the notation V for denoting sup or max. In particular, one can show that 
(KU {— oo}, V, +) is also a dioid, and construct a max-plus convolution and decon- 
volution, which are defined as follows. 

3.2.1 Max-plus Convolution and Deconvolution 

Definition 3.2.1 (Max-plus convolution). Let f and g be two functions or se- 
quences of T. The max-plus convolution of f and g is the function 

{f^9){t)= sup {/(t-s) +fif(s)}. 

0<s<i 



(3.24) 
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(Ift < 0, = Oj. 

Definition 3.2.2 (Max-plus deconvolution). Let f and g be two functions or se- 
quences of IF. The max-plus deconvolution of f by g is the function 

= inf {fit + u) - g(u)} . (3.25) 



3.2.2 Linearity of Min-plus Deconvolution in Max-plus Algebra 

Min-plus deconvolution is, in fact, an operation that is linear in (K+ , V, -I-). Indeed, 
one easily shows the following property. 

Theorem 3.2.1 (Linearity of 0 in max-plus algebra). Let f,g,h £ IF. 

• Rule 16 (Distributivity of (Z) with respect toy) (fyg)(dh = [f C)h)y {gZh). 

• Rule 17 (Addition of a constant) For any K € if+K)(Z)g = if(Zg)+K. 

Min-plus convolution is not, however, a linear operation in (K+ , V, -I-), because 
in general 

(/ V fif) 0 /i ^ (/ 0 /i) V (fif 0 h). 

Indeed, take / = ()zr,t, g = and h = \ 2 r for some R,T > 0. Then using 
Rule 9, one easily computes (see Figure 3.11) that 



f ®h 
g 

ifyg)®h 



= /?3i?,T 0 Mr = 02R,T 

= Aij 0 X 2 R = Xr 

= i(izR,T V Xr) 0 X 2 R = (i 2 R,ZT/i V Xr 
MR,ry Xr = if ®h)y ig®h). 



Conversely, we have seen that min-plus convolution is a linear operation in (K + , A , -I-) , 
and one easily shows that min-plus deconvolution is not linear in (K+ , A, -I-). Fi- 
nally, let us mention that one can also replace -I- by A, and show that (K U {-boo} U 
{— 00 }, V, a) is also a dioid. Remark However, as we have seen above, as soon as 
the three operations A, V and -\- are involved in a computation, one must be careful 
before applying any distribution. 



3.3 Exercises 

Exercise 3.1. 1. Compute a 0 6 for any function a 

2. Express the rate-latency function by means of 5 and X functions. 
Exercise 3.2. 7. Compute when (Hi is a rate-latency function 

2. Compute 0 M with /3i(f) = R(t — T)~^ and Mit) = {ft + 6)l|j>o} 



Exercise 3.3. 7. Is 0 distributive with respect to the min operator ? 
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Figure 3.11: Function (/ ® /i) V (gi ® h) (left) and {f V g) ^ h (right) when 
/ = /Jsfl.T, = Aij and /i = X 2 R for some R,T > 0. 




Chapter 4 



Min-plus and Max-plus System 
Theory 



In Chapter 3 we have introduced the basic operations to manipulate functions and 
sequences in Min-Plus or Max-Plus algebra. We have studied in detail the opera- 
tions of convolution, deconvolution and sub-additive closure. These notions form 
the mathematical cornerstone on which a first course of network calculus has to be 
built. 

In this chapter, we move one step further, and introduce the theoretical tools to 
solve more advanced problems in network calculus developed in the second half 
of the book. The core object in Chapter 3 wsvt functions and sequences on which 
operations could be performed. We will now place ourselves at the level of operators 
mapping an input function (or sequence) to an output function or sequence. Max- 
plus system theory is developed in detail in [24], here we focus on the results that 
are needed for the remaining chapters of the book. As in Chapter 3, we focus here 
Min-Plus System Theory, as Max-Plus System Theory follows easily by replacing 
minimum by maximum, and inhmum by supremum. 



4.1 Min-plus and Max-plus Operators 

4.1.1 Vector Notations 

Up to now, we have only worked with scalar operations on scalar functions in T or 
Q. In this chapter, we will also work with vectors and matrices. The operations are 
extended in a straightforward manner. 

Let J be a hnite, positive integer. For vectors z, z' € ^ , we define i’A T' as 

the coordinate-wise minimum of z and z', and similarly for the -|- operator. We write 
z < z' with the meaning that zj < Zj for 1 < j < J. Note that the comparison 
so defined is not a total order, that is, we cannot guarantee that either z < z' or 



J.-Y. Le Boudec and P. Thiran, Network Calculus, LNCS 2050, pp. 159—182, 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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z' < z holds. For a constant K, we note z -i- K the vector defined by adding K to 
all elements of z. 

We denote by the set of J-dimensional wide-sense increasing real-valued 
functions or sequences of parameter t, and the subset of functions that are zero 
fori < 0. 

For sequences or functions x{t), we note similarly (x A y){t) = x{t) A y{t) 
and {x -\- K)(t) = x{t) -\- K for all f > 0, and write x < y with the meaning that 
x{t) < y{t) for all t. 

For matrices A,B G ^ x K+ ^ , we define AABa& the entry-wise minimum 
of A and B. For vector z € , the ‘multiplication’ of vector z € ^ by matrix 

A is - remember that in min-plus algebra, multiplication is the -|- operation - by 

A-\- z, 

and has entries mini<j<j(ay -I- Zj). Likewise, the ‘product’ of two matrices A and 
B is denoted hy A-\- B and has entries mini<j<j(ay -I- bjk) for 1 < i,k < J. 

Here is an example of a ‘multiplication’ of a vector by a matrix, when J = 2 




and an example of a matrix ‘multiplication’ is 

5 3 1 [ 2 4 1 _ [ 4 3 

1 3j'''[l oJ“[3 3 



j2 

We denote by T the set of J x J matrices whose entries are functions or 
sequences of T, and similarly for . 

The min-plus convolution of a matrix A ^ by a vector z € is the vector 
of defined by 

(A^z)(t)= inf {A(f — s) -b £( 5 )} 

0<s<t 



and whose J coordinates are thus 



min {oij ® Zj){t) = inf min {oy (f - s) -b Zj{s)}. 

1<J<J 

Likewise, A^ B is defined by 

{A'g)B){t)= inf {A(f — s) -b H(s)} 

0<s<t 

and has entries mini<j<j(ay ® bjk) for 1 < i,k < J. 

For example, we have 

00 ^ 7r/2,() _ K/^Tr/2,b 

00 St S 2 T S^T 
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and 



00 


6d 


lrl2,b 


^r,b 




^ Tr/2,& 




00 St 




S 2 T 






SzT 


Pr,T 



Finally, we will also need to extend the set of wide-sense increasing functions 
Q to include non decreasing functions of two arguments. We adopt the following 
definition (a slightly different definition can be found in [10]). 

Definition 4.1.1 (Bivariate wide-sense increasing functions). We denote by Q the 
set of bivariate functions (or sequences) such that for all s' < s and any t < t' 

< f{t,s') 

f{t,s) < f{t',s). 

We call such functions bivariate wide-sense increasing functions. 

In the multi-dimensional case, we denote by the set of J x J matrices whose 
entries are wide-sense increasing bivariate functions. A matrix of A{t) £ is a 
particular case of a matrix H {t, s) € , with s set to a fixed value. 

4.1.2 Operators 

A system is an operator II mapping an input function or sequence x onto an output 
function or sequence y = II (x). We will always assume in this book that x,y € , 

where J is a fixed, finite, positive integer. This means that each of the J coordinates 
1 < j < J, is a wide-sense increasing function (or sequence) of t. 

It is important to mention that Min-plus system theory applies to more general 
operators, taking to , where neither the input nor the output functions are 
required to be wide-sense increasing. This requires minor modifications in the defi- 
nitions and properties established in this chapter, see [24] for the theory described in 
a more general setting. In this book, to avoid the unnecessary overhead of new no- 
tations and definitions, we decided to expose min-plus system theory for operators 
taking to . 

Most often, the only operator whose output may not be in JF is deconvolution, 
but all other operators we need will take to . 

Most of the time, the dimension of the input and output is J = 1, and the 
operator takes T to T . We will speak of a scalar operator. In this case, we will drop 
the arrow on the input and output, and write y = II (a;) instead. 

We write Hi < II 2 with the meaning that IIi(£) < II 2 (;e) for all x, which in 
turn has the meaning that II 1 (£) (f) < II 2 (;r) (f) for all t. 

For a set of operators D^, indexed by s in some set S, we call inf^gs the 
operator defined by [inf^gs Il 5 ](a;(f)) = inf 5 gs[n 5 (a;(f))]. For S = {1,2} we 
denote it with Hi A II 2 . 

We also denote by o the composition of two operators: 

(Hi on2)(f) = ni(n2(f)). 

We leave it to the alert reader to check that inf ^gs and Hi o II 2 do map functions 
in to functions in . 
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4.1.3 A Catalog of Operators 

Let us mention a few examples of scalar operators of particular interest. The first two 
have already been studied in detail in Chapter 3, whereas the third was introduced in 
Section 1.7. The fact that these operators map into follows from Chapter 3. 

Definition 4.1.2 (Min-plus convolution C^)- 

Cct : J- J- 

x{t) y{t) = Ca{x){t) = {a igi x){t) - s) -\- x{s)} , 

for some <7 ^ IF. 

Definition 4.1.3 (Min-plus deconvolution T>„). 

V, ■. F g 

x(t) y(t) =V^(x)(t) = (x(Z)a)(t) = sup^yo{x(t -\-u) - a(u)} , 

for some a G F. 

Note that Min-plus deconvolution produces an output that does not always be- 
long to F. 

Definition 4.1.4 (Packetization Vl)- 

Vl ■ F ^ F 

x{f) y{f) = VL{x){f) = P^{x{f)) = infjgN {L{i)'^L{i+i)>x] , 

for some wide-sense increasing sequence L (defined by Definition 1.7.1). 

We will also need later on the following operator, whose name will be justified 
later in this chapter. 

Definition 4.1.5 (Linear idempotent operator h„). 

h(j : F F 

x(t) y(t) = h„{x){t) = info< 5 <i - a(s) -t- a;(s)} , 

for some o ^ F. 

The extension of the scalar operators to the vector case is straightforward. The 
vector extension of the convolution is for instance: 

Definition 4.1.6 (Vector min-plus convolution Cs). 

Cy. ■. F^ ^ F^ 

x{f) y(t) = Cs(f)(t) = (E ® :r)(t) = info<5<i {S(t - s) + :r(s)} , 



for some E € F^ . 
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If the {i, j)th entry of E is (Ty , the ith component of y{t) reads therefore 
yi{t) = ml min - s) -\- Xj{s)} 

0<s<t l<j<t7 

Let us conclude with the shift operator, which we directly introduce in the vector 
setting: 

Definition 4.1.7 (Shift operator St)- 

St ■. gJ ^ gJ 

x{t) y{t) = ST{x){t) = x(t - T), 

for some T € IR. 

Let us remark that iSq is the identity operator: iSq {x) = x. 



4.1.4 Upper and Lower Semi-continuous Operators 

We now study a number of properties of min-plus linear operators. We begin with 
that of upper-semi continuity. 

Definition 4.1.8 (Upper semi-continuous operator). Operator II is upper semi- 
continuous if for any (finite or infinite) set of functions or sequences {xn}, Xn G 

n (inf{f„}') = inf {n(f„)} . (4.1) 

We can check that Cs, and St are upper semi-continuous. For example, 
for Cs , we check indeed that 

Cs ^inf{f„}j (t) 



= inf inf {E(f — s) -h £„(s)} 

0<s<t n 

= inf inf {E(f — s) -|- £„(s)} 

n 0<s<t 

= inf{Cs(f„)(t)}. 

n 



Likewise, noting that L{i -h 1) < inf„gN{a;„} if and only if L{i -|- 1) < a;„ for 
all n G N, we get that 

l{L(*-|-l)<inf„eH{®n}} = inf l|£(j_|_i)<3;^} 



and thus we get that Vl ts upper semi-continuous: 
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Vl 



sup l{L(i+l)<inf„ {ajn } } 

*GN 

sup |inf {i(*)l{L(*+l)<a:„}}| 
mf{VL{Xn)}- 

n 



On the other hand, D^ is not upper semi-continuous, because its application to 
an inf would involve the three operations sup, inf and -I-, which do not commute, as 
we have seen at the end of the previous chapter. 

It is easy to show that if Hi and II 2 are upper semi-continuous, so are Hi A II 2 
and Hi o II 2 . 

The dual definition of upper semi-continuity is that of lower semi-continuity, 
which is defined as follows. 

Definition 4.1.9 (Lower semi-continnons operator). Operator II is lower semi- 
continuous if for any (finite or infinite) set of functions or sequences {xn}, Xn G , 

n ( sup{f„} ] = sup {n(f„)} . (4.2) 

\ n J n 



It is easy to check that is lower semi-continuous, unlike other operators, 
except St which is also lower semi-continuous. 



4.1.5 Isotone Operators 

Definition 4.1.10 (Isotone operator). Operator II is isotone if x\ < X 2 always 
implies II(£i) < II(£ 2 )- 

All upper semi-continuous operators are isotone. Indeed, if xi < X 2 , then xi A 
X 2 = xi and since II is upper semi-continuous, 

n(£i) = n(£i A X 2 ) = n(£i) a u{x2) < h{x2)- 

Likewise, all lower semi-continuous operators are isotone. Indeed, if xi < X 2 , 
then xiV X 2 = X 2 and since II is lower semi-continuous, 

n(£i) < n(£i) V fi{x2) = n(£i v X2) = ff{x2)- 
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4.1.6 Linear Operators 



In classical system theory on (K, +, x), a system II is linear if its output 
combination of inputs is the linear combination of the outputs to each 
input. In other words, II is linear if for any (finite or infinite) set of inputs 
for any constant k G M., 




to a linear 
particular 
{xi}, and 



and for any input x and any constant k G W, 



U{k ■ x) = k ■ n(a;). 



The extension to min-plus system theory is straightforward. The first property 
being replaced by that of upper semi-continuity, a min-plus linear operator is thus 
defined as an upper semi-continuous operator that has the following property (“mul- 
tiplication” by a constant): 

Definition 4.1.11 (Min-plns linear operator). Operator II is min-plus linear if it 
is upper semi-continuous and if for any x & and for any k > 0, 

U{x-\-k) =U{x) -\-k. (4.3) 



One can easily check that C^, Cs, and St are min-plus linear, unlike "D^ and 
Vl- E>fj is not linear because it is not upper semi-continuous, and is not linear 
because it fails to verify (4.3). 

In classical linear theory, a linear system is represented by its impulse response 
h{t,s), which is defined as the output of the system when the input is the Dirac 
function. The output of such a system can be expressed as 

/ OO 

h(t, s)x(s)ds 

-OO 



Its straightforward extension in Min-plus system theory is provided by the following 
theorem [24]. To prove this theorem in the vector case, we need first to extend the 
burst delay function introduced in Definition 3.1.2, to allow negative values of the 
delay, namely, the value T in 




iff < T 
iff > T, 



is now taking values in K. We also introduce the following matrix D t^q-’ X g-’. 



Definition 4.1.12 (Shift matrix). The shift matrix is defined by 





Srit) 


OO 


OO 


OO 




OO 


Srit) 


OO 




DtH) = 


OO 


OO 


ST(t) 





: : OO 

OO ... OO St (t) 
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for some T £ M. 



Theorem 4.1.1 (Min-plus impulse response). II is a min-plus linear operator if 
and only if there is a unique matrix H £ Q'^ (called the impulse response), such 
that for any x € and any i € IR, 

n(£)(i) = inf s) -I- £(s)} . (4.4) 

sGR 



Proof: If (4.4) holds, one immediately sees that II is upper semi-continuous and 

verifies (4.3), and therefore is min-plus linear. II maps to Q'^ because H £ . 

Suppose next that II is min-plus linear, and let us prove that there is a unique 
matrix H{t, s) € such that (4.4) holds. 

Let us first note that Dg{t) -t- x{s) = x{s) for any s >t. Since x G ,we have 

inf {Dg{t) -\- ^(s)} = inf {^(s)} = x(t). 

S>t S>t 

On the other hand, all entries of Dg{t) are infinite for s < t. We have therefore that 

inf {Dg(t) -\- ^(s)} = 00 

S<t 

We can combine these two expressions as 

x(t) = inf {Dg(t) -\- ^(s)| , 

sGM 

or, dropping explicit dependence on t, 

X = inf iDg -b ;e(s)| . 

sGM 

Let dgj denote the jth column of Dg-. 



00 

00 
00 

00 

where Ss is located at the jth position in this vector. Using repeatedly the fact IT is 
min-plus linear, we get that 
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Defining 



where 



n(f) = n inf{D^+f(s)} 

\^sGR 

= inf {n + f(s))} 

sGR 



inf 

sGR 



inf < min llT f. 
5GK ^ V 



ds,j +Xj(s) 



H(t,s) = ^hi(t,s) ... hj(t,s) ... hj(t,s) 



hj(t,s) = n (^4i) (t) 

for all f e IR, we obtain therefore that 

n(f)(f) = inf i mm I hj(t,s) 4- Xj(s)\ \ = inf {i7(f, s) + f(s)} . 



(4.5) 

(4.6) 



We still have to check that H (t, s) £ . Since for any fixed s, IT j G , we 

have that for any t <t' 

hj(t,s) = n (^dsj^ (t) < n (^dsj^ (f) = hj(t',s), 

hence H{t,s) < On the other hand, if s' < s, one easily check that 

ds,j < ds'j. Therefore, since II is isotone (because it is linear and thus upper semi- 
continuous). 



hj{t,s) = n (f) < n (f) = hj{t,s') 

and therefore H{t, s) < H{t, s') for any s > s' . This shows that H{t, s) € Q'' . 

To prove uniqueness, suppose that there is another matrix H' £ Q'^ that satisfies 
(4.4), and let h' j denote its jth column. Then for any u £ M. and any 1 < j < J, 
taking x = duj as the input, we get from (4.6) that for f € IR 

hj(t,u) = n (f) = inf 1 77' (f,s) -I- c4j(s) I 

= inf lh'j{t,s) -l-()u(s)| = inf lh'j{t,s)\ = h'j{t,u). 

I J s<u I J 



Therefore H' = H. □ 

We will denote a general min-plus linear operator whose impulse response is H 
by Ch- In other words, we have that 
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CH{x){t) = inf {H{t,s) +£(s)}. 



One can compute that the impulse response corresponding to C s is 



to is 



and to St is 



««.») = { 



if s < f 
if s > f ’ 



if s < f 
if s > f ’ 



H{t, s) = Dxit — s). 



In fact the introduction of the shift matrix allows us to write the shift operator as a 
min-plus convolution: St = Cdt > 0. 

Let us now compute the impulse response of the compostion of two min-plus 
linear operators. 



Theorem 4.1.2 (Composition of min-plus linear operators). Let Ch and Ch' be 

two min-plus linear operators. Then their composition Ch ° Ch> is also min-plus 
linear, and its impulse repsonse denoted by H o H' is given by 



{HoH'){t,s) 



inf {H{t,u) -\- H'{u, s)} . 



Proof: The composition Ch ° Ch’ applied to some £ € is 

CH{CH'{x)){t) = inf u) -I- inf s) -I- £(s)}| 

= inf mf {H{t,u) -\- H' {u, s) -\- x{s)} 

U S 

= inf |inf s) -I- s)} -I- £(s)| . 

s K u ) 

□ 

We can therefore write 

Ch ° Ch' = ChoH' ■ 

Likewise, one easily shows that 

Ch a Ch' = Chah' ■ 

Finally, let us mention the dual definition of a max-plus linear operator. 

Definition 4.1.13 (Max-plus linear operator). Operator II is max-plus linear if it 
is lower semi-continuous and if for any x G and for any k >0, 



H{x -\-k) = n (£) -I- k. 



(4.7) 
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Max-plus linear operators can also be represented by their impulse response. 

Theorem 4.1.3 (Max-plus impulse response). II is a max-plus linear operator if 
and only if there is a unique matrix H £ {called the impulse response), such 
that for any x € and any i € IR, 

n(£)(i) = sup s) -b £(s)} . (4.8) 

sGR 

One can easily check that "D^ and St are max-plus linear, unlike C^, and 
Vl. 

For example, "D^ (x) (t) can be written as 

Vfj{x){t) = sup{a;(i-|-u) — <t(u)} = sup{a;(s) — <t(s — i)} = sup{a;(s) — <t(s — i)} 

li>0 S>t 

which has the form (4.8) if H{t, s) = —a{s — f). 

Likewise, St (x) (t) can be written as 

St (x) (t) = x(t — T) = sup{£(s) — D-t(s — t)} 
sGM 

which has the form (4.8) if s) = —D-t{s — t). 



4.1.7 Causal Operators 

A system is causal if its output at time t only depends on its input before time t. 

Definition 4.1.14 (Causal operator). Operator II is causal if for any t, xi{s) = 
X2(s) for all s < t always implies II(£i)(f) = II(£2)(f)- 

Theorem 4.1.4 (Min-plus causal linear operator). A min-plus linear system with 
impulse response H is causal if H{t, s) = H (t, t) for s > t. 



Proof: If H{t, s) 

Xl,X2 G 

jCH(xi)(t) 



= 0 for s > t and if xi{s) = X2{s) for all s < f then since 

= inf -b £i(s)} 

sGM 

= inf {H(t, s) -b xi (s)} A inf {H(t, s) -b xi (s)} 

s<t s>t 

= inf {H(t, s) -b :ri(s)} A inf + ^i(s)} 

s<t s>t 

= inf s) -b ;ei(s)} 

S<t 

= inf s) -b ;E2 (s)} 

S<t 

= inf {H{t, s) -b :T2 (s)} A inf + ^2(s)} 

s<t s>t 

= inf {H{t, s) -b :T2 (s)} A inf {H{t, s) -b :?2(s)} 

s<t s>t 

= inf {H(t,s) -\- X2(s)} = £H(x2)(t). 
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□ 

Ca, Cs, /i(T and Vl are causal. St is causal if and only if T > 0. "D^ is not 
causal. Indeed if xi{s) = X 2 {s) for all s < t, but that xi{s) ^ X 2 {s) for all s > t, 
then 



'Da{xi){t) = sup {£i(f + u) - (t(u)} 
u>0 

^ sup {£2(i + w) - O-(u)} 

U>0 

= V^{xi){t) 

4.1.8 Shift-invariant Operators 

A system is shift-invariant, or time-invariant, if a shift of the input of T time units 
yields a shift of the output of T time units too. 

Definition 4.1.15 (Shift-invariant operator). Operator II is shift-invariant if it 
commutes with all shift operators, i.e. if for any x & Q and for any T € IR 

n(5T(f)) = 5T(n(f)). 

Theorem 4.1.5 (Shift-invariant min-plus linear operator). Let Ch and Ch' be 

two min-plus linear, shift-invariant operators. 

(i) A min-plus linear operator Ch E shift-invariant if and only if its impulse 
response H(t,s) depends only on the difference (t — s). 

(ii) Two min-plus linear, shift-invariant operators Ch and Ch> commute. If they 
are also causal, the impulse response of their composition is 

{HoH'){t,s)= inf {H{t-s-u)-\-H'{u)} = {H^H'){t-s). 

0<u<t—s 



Proof: (i) Let hj (t,s) and dsj (t) denote (respectively) the jth column of H{t,s) 

and of Dg{t). Note that ds,j{t) = Ss{do,j){t)- Then (4.6) yields that 

hj{t,s) = n (^dgj^ (t) = n (Sg{do,j)'j (t) 

= Sg (f) = {t- ,s) = hj{t - s,o) 

Therefore H {t, s) can be written as a function of a single variable H(t — s). 

(ii) Because of Theorem 4.1.2, the impulse response of Ch ° Ch’ is 

[H o H'){t, s) = inf {H(t, u) -h H'(u, s)} . 

U 

Since H{t,u) = H(t — u) and H'{u,s) = H'(u — s), and setting v = u — s, the 
latter can be written as 

{H o H'){t, s) = inf {H{t — u) -\- H'{u — s)} = inf {H{t — s — v) -\- H'{v)} . 

U V 
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Similarly, the impulse response of Ch' ° C.H can be written as 
{H' o H){t, s) = inf {H'{t — u) + H{u — s)} = inf {H{v) + H'{t — s — r;)} 

U V 

where this time we have setv = t — u. Both impulse responses are identical, which 
shows that the two operators commute. 

If they are causal, then their impulse response is infinite for t > s and the two 
previous relations become 

{HoH'){t,s) = {H'oH){t,s) = inf {H{t - s - v) + H' {v)} = {H®H'){t-s). 

Q<v<t 

□ 

Min-plus convolution Cs (including of course and St) is therefore shift- 
invariant. In fact, it follows from this theorem that the only min-plus linear, causal 
and shift-invariant operator is min-plus convolution. Therefore /i^ is not shift- 
invariant. 

Min-plus deconvolution is shift-invariant, as 
Vfj{ST{x)){t) = sup{iST(a;)(f -I- u) — (t(u)} = sup{a;(f -I- u — T) — (t(u)} 

ii>0 ii>0 

= (a; 0 a){t -T)= V,{x){t -T)=St (V,) (x)(t). 

Finally let us mention that is not shift-invariant. 

4.1.9 Idempotent Operators 

An idempotent operator is an operator whose composition with itself produces the 
same operator. 

Definition 4.1.16 (Idempotent operator). Operator II is idempotent if its self- 
composition is n, i.e. if 

non = n. 

We can easily check that /i^ and Vl are idempotent. If a is sub-additive, with 
(t( 0) = 0, then ° = C„, which shows that in this case, is idempotent too. 

The same applies to T>„ . 

4.2 Closure of an Operator 

By repeatedly composing a min-plus operator with itself, we obtain the closure of 
this operator. The formal definition is as follows. 

Definition 4.2.1 (Sub-additive closure of an operator). Let flbe a min-plus oper- 
ator taking . Denote n^") the operator obtained by composing n (n - 1) 

times with itself. By convention, n^*^) = iSo = Cdq, so n(i) = n, n(2) = non, etc. 
Then the sub-additive closure of If, denoted by n, is defined by 

n = 5o A n A (n o n) A (n o n o n) A . . . = inf { n(”) 

n>0 I 



(4.9) 
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In other words. 



n(£) = X A n(£) A n(n(£)) a . . . 

It is immediate to check that II does map functions in to functions in . 

The next theorem provides the impulse response of the suh-additive closure of 
a min-plus linear operator. It follows immediately from applying recursively Theo- 
rem 4.1.2. 

Theorem 4.2.1 (Sub-additive closure of a linear operator). The impulse response 
of Ch is 

H(t,s) = mi inf {H{t,ui) H{ui,U 2 ) + ■ ■ ■ + H{un, s)'\ . (4.10) 

nSN Un 



and Ch = C-fj. 

For a min-plus linear, shift-invariant and causal operator, (4.10) becomes 
H{t - s) 

= inf inf ~ ui) -t- H{u\ — U 2 ) + ■ ■ ■ + H(un — s)} 

nSN 5<lin 

= inf . . inf^ _ {H{t - s - vi) -\- H{vi - V 2 ) + ■ ■ ■ + H{vn)} 

nSN <t—s 



= inf{Ff(”i}(t-s) 



(4.11) 



where = H®H^...®H(n times, n > 1) and H^^^ = So. 

In particular, if all entries (Ty (f) of E(t) are sub-additive functions, we hnd that 

Ct, = Cs- 



In the scalar case, the closure of the min-plus convolution operator C ^ reduces 
to the min-plus convolution of the sub-additive closure of cr: 

C =C- 

If (T is a “good” function (i.e., a sub-additive function with (t( 0) = 0), then C „ = C^- 
The sub-additive closure of the idempotent operators and Vl are easy to 
compute too. Indeed, since h^ix) < x and Vl{x) < x, 

hfj hfj 



and 

Vl=Vl- 

The following result is easy to prove. We write II < II' to express that II(£) < 
n'(£) for all £ e . 

Theorem 4.2.2 (Sub-additive closure of an isotone operator). If II and II ' are 

two isotone operators, andH < II', then II < II'. 
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Finally, let us conclude this section by computing the closure of the minimum 
between two operators. 

Theorem 4.2.3 (Sub-additive closure of Hi A II 2 ). Let IIi,n 2 be two isotone 
operators taking . Then 

III A II2 = (III A iSo) o (II2 A iSo). ( 4 . 12 ) 



Proof: (i) Since iSq is the identity operator, 

III A II2 = (III ° >So) A (So o II2) 

> ((Hi A iSo) o 5 o) A (iSo o (II2 A iSq)) 

> ((Hi A iSo) o (II2 A iSo)) A ((Hi A iSq) o (II2 A iSq)) 

= (Hi A iSo) o (II2 A iSo). 

Since Hi andll2 are isotone, so are Hi AII2 and (Hi AiSo)o(Il2AiSo). Consequently, 
Theorem 4 . 2.2 yields that 

III A II 2 > (III A iSo) o (III A iSo). (4.13) 

(ii) Combining the two inequalities 

111 A Sq S III A II2 A Sq 

112 A Sq S III A II2 A Sq 



we get that 

(III A iSo) o (III A iSo) > (III A II2 A iSo) ° (IIi A II2 A iSq). ( 4 . 14 ) 

Let us show by induction that 

((Hi An 2 )A 5 o)^”^ = min {(Hi A n2)(*^ | . 

0<k<n I J 

Clearly, the claim holds for n = 0 , 1 . Suppose it is true up to some n € N. Then 



((Hi An2)A5o)(”+'^ 

= ((Hi A U2) A 5 o) o ((Hi A U2) A 

= ((Hi AD2) A 5 o) o (^^min^{(ni An2)(*)}j 

= ^(Di AD2) o^min | (Hi A 02)^*) | j A o ^min |(niAn2)(*^| 



= min 

l<Ai<n+l 



{(Hi An, 



A min 

0<k<n 



{(Hi An, 



= min 

0<Ai<n+l 






174 



CHAPTER 4. MIN-PLUS AND MAX-PLUS SYSTEM THEORY 



Therefore the claim holds for all n € N, and 



(((Hi An2)A5o)o((ni An2)A5o))(”^ = ((Hi An2)A5o)('”^ 

= min |(ni An 2 )(*)| . 
0<k<2n I J 



Consequently, 



(III A II2 A iSo) o (III A II2 A iSo) 



inf min |(niAn 2 )(*U 
nGN0<i;<2n I J 

i„f{(n,An,r} 



— Hi A II2 



and combining this result with (4.13) and (4.14), we get (4.12). □ 

If one of the two operators is an idempotent operator, we can simplify the previ- 
ous result a bit more. We will use the following corollary in Chapter 9. 

Corollary 4.2.1 (Sub-additive closure of II 1 A Hm)- Let Hi be an isotone operator 
taking T ^ T , and let M £ T. Then 

Hi A /iM = {Lm o Hi) o Hm- (4.15) 



Proof: Theorem 4.2.3 yields that 

Hi A /iM = (III A iSo) o hM (4.16) 

because Hm < >5o- The right hand side of (4.16) is the inf over all integers n of 

({Hi A5 o}o/im)^”^ 

which we can expand as 

{Hi A .So} oHm ° {Hi A 5 o) ohM ° {Hi A 5 o) o Hm- 

Since 

Hm ° {Hi ASo}ohM = {Lm °Hi o Hm} A Hm 
= {{Hm °Hi} ASo) o Hm 
= ^mm^ I (/iM ° III) I o /iM, 

the previous expression is equal to 

min ^ |(/iM ° IIi)^''^! o hM- 
Therefore we can rewrite the right hand side of (4.16) as 
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(III A .So) o hM 





inf I (/iM ° IIi)^''^| oHm = {hm ° IIi) o hm, 



which establishes (4.15). 

Therefore we can rewrite the right hand side of (4.16) as 

(Hi A 5o) o hM = inf < min | (Hm ° IIi)^®^ | o Hm \ 
kGN (0<9<n I J J 

= hM ° |(^Af ° IIi)^®^ I o hM = hM ° {hM ° III) ° hM, 



which establishes (4.15). □ 

The dual of super-additive closure is that of super-additive closure, defined as 
follows. 

Definition 4.2.2 (Super-additive closure of an operator). Let II be an operator 
taking . The super-additive closure ofU, denoted by II, is defined by 

n = 5o V n V (n o n) V (n o n o n) V . . . = sup I | . (4.i7) 

n>0 ^ J 



4.3 Fixed Point Equation (Space Method) 

4.3.1 Main Theorem 

We now have the tools to solve an important problem of network calculus, which has 
some analogy with ordinary differential equations in conventional system theory. 

The latter problem reads as follows: let II be an operator from to , and let 

a e . What is then the solution x{t) to the differential equation 

^(t)=U{x){t) (4.18) 

with the inital condition 

f(0) = a. (4.19) 

Here H is an operator taking , and a G . The problem is now to find 

the largest function x{t) € , which verifies the recursive inequality 

x{t) < U{x){t) (4.20) 

and the initial condition 

x(t) < a(t). (4.21) 

The differences are however important: first we have inequalities instead of 

equalities, and second, contrary to (4.18), (4.20) does not describe the evolution 
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of the trajectory x{t) with time t, starting from a fixed point a, but the successive 
iteration of II on the whole trajectory x{t), starting from a fixed, given function 
a(t) e . 

The following theorem provides the solution this problem, under weak, technical 
assumptions that are almost always met. 

Theorem 4.3.1 (Space method). Let II be an upper semi-continuous operator tak- 
ing . For any fixed function a £ , the problem 

X < a A n(£) (4.22) 

has one maximum solution in , given by x* = 11(a). 

The theorem is proven in [24]. We give here a direct proof that does not have the 
pre-requisites in [24]. It is based on a fixed point argument. We call the application 
of this theorem “Space method”, because the iterated variable is not time t (as in the 
“Time method” described shortly later) but the full sequence x itself. The theorem 
applies therefore indifferently whether f € Z or f € IR. 



Proof: (i) Let us first show that 11(a) is a solution of (4.22). Consider the sequence 

{£"} of decreasing sequences defined by 



xo = a 

Xn+l = XnAU(Xn), n > 0. 



Then one checks that 






is a solution to (4.22) because x* < xq 
so that 



inf 

n>0 

= d and because II is upper-semi-continuous 



n(£*) = n(inf {xn}) = inf {II(f„)} > inf > inf {f„} = x* . 

n>0 n>0 n>0 n>0 

Now, one easily checks that x„ = info<m<n{n^™^ (®)}> 

= mf{f4 = inf inf {n(™)(a)| = mf {n(”) (a)} = n(a). 

n>0 n>0 0<m<n n>0 

This also shows that x* £ . 

(ii) Let £ be a solution of (4.22). Then x < a and since II is isotone, II(£) < 11(a). 
From (4.22), x < II(£), so that x < 11(a)- Suppose that for some n > 1, we 
have shown that x < Ili"“^i(a). Then as £ < II(£) and as II is isotone, it yields 
that X < ni"i(a). Therefore x < inf„>o{Ili"i (a)} = 11(a), which shows that 
;?* = n(a) is the maximal solution. □ 

Similarly, we have the following result in Max-plus algebra. 

Theorem 4.3.2 (Dual space method). Let H be a lower semi-continuous operator 
taking . For any fixed function a £ Q'^ , the problem 

X > dv n(£) 

has one minimum solution, given by x* = 11(a). 



(4.23) 
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4.3.2 Examples of Application 

Let us now apply this theorem to five particular examples. We will first revisit the 
input-output characterization of the greedy shaper of Section 1.5.2, and of the vari- 
able capacity node described at the end of Section 1.3.2. Next we will apply it to 
two window flow control problems (with a fixed length window). Finally, we will 
revisit the variable length packet greedy shaper of Section 1.7.4. 

Input-Output Characterization of Greedy Shapers 

Remember that a greedy shaper is a system that delays input bits in a buffer, when- 
ever sending a bit would violate the constraint a, but outputs them as soon as possi- 
ble otherwise. If R is the input flow, the output is thus the maximal function x G IF 
satisfying the set of inequalities (1.13), which we can recast as 

X < R A Ca-(x). 

It is thus given by R* = Ca-(x) = C-^(x) = W (g> x. If cr is a “good” function, one 
therefore retrieves the main result of Theorem 1.5.1. 



Input-Output Characterization of Variable Capacity Nodes 



The variable capacity node was introduced at the end of Section 1.3.2, where the 
variable capacity is modeled by a cumulative function M (t), where M (t) is the total 
capacity available to the flow between times 0 and t. If m{t) is the instantaneous 
capacity available to the flow at time t, then M (t) is the primitive of this function. 
In other words, if f € IR, 

M{t) = [ m{s)ds (4.24) 



Jo 

and if f e Z the integral is replaced by a sum on s. If i? is the input flow and x is 
the output flow of the variable capacity node, then the variable capacity constraint 
imposes that for all 0 < s < f 



x{t) — x(s) < M{t) — M{s), 



which we can recast using the idempotent operator /i m as 

X < hM{x)- (4.25) 

On the other hand, the system is causal, so that 

x<R. (4.26) 



The output of the variable capacity node is therefore the maximal solution of system 
(4.25) and (4.26). It is thus given by 

R*(t) = hM(R)(t) = hM(R)(t) = inf {M(t) - M(s) + R(s)} 

0<s<t 

because the sub-additive closure of an idempotent operator is the operator itself, as 
we have seen in the previous section. 
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Static window flow control - example 1 

Let us now consider an example of a feedback system. This example is found inde- 
pendently in [9] and [60, 2], A data flow a{t) is fed via a window flow controller to 
a network offering a service curve /3. The window flow controller limits the amount 
of data admitted into the network in such a way that the total backlog is less than or 
equal to W, where W > 0 (the window size) is a fixed number (Figure 4.1). 



a 




Figure 4.1: Static window flow control, from [9] or [60] 



Call x{t) the flow admitted to the network, and y{t) the output. The definition 
of the controller means that x{t) is the maximum solution to 



f x(t) < a(t) 

{ x{t) < y(t) + W 



(4.27) 



We do not know the mapping II : x —>■ y = II (a;), but we assume that II is isotone, 
and we assume that y(t) > (/3 (g> x)(t), which can be recast as 



n(a;) > C/ 3 (x). 



(4.28) 



We also recast System (4.27) as 

x<aA {n(a;) 4- W} , (4.29) 



and direclty apply Theorem 4.3.1 to derive that the maximum solution is 

X = Jfr+W){a). 



Since II is isotone, so is II -|- W. Therefore, because of (4.28) and applying 
Theorem 4.2.2, we get that 

X = (nTW){a) > {Cp4-W){a). (4.30) 

Because of Theorem 4.2.1, 

(Cp -I W)(a) = Cp+w{a) = = (/3 -b IT) ® a. 

Combining this relationship with (4.30) we have that 

y>l3(»x>l3(» (WTW) ® a) = (/) ® (/) -b IT)) (a), 
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which shows that the complete, closed-loop system of Figure 4. 1 offers to flow a a 
service curve [9] 

/^wfci =P<»iO + W). (4.31) 

For example, if /3 = then the service curve of the closed-loop system is 
the function represented on Figure 4.2. When RT < W, the window does not add 
any restriction on the service guarantee offered by the open-loop system, as in this 
case /Jvvfci = /^- > FF on the other hand, the service curve is smaller than 

the open-loop service curve. 




Casel:RT<W Case 2: RT > W 



Figure 4.2: The service curve of the closed-loop system with static 

window flow control, when the service curve of the open loop system is i3r^t 
with RT <W (left) and RT>W (right). 



Static window flow control - example 2 

Let us extend the window flow control model to account for the existence of back- 
ground traffic, which constraints the input traffic rate at time t, dxjdt{t) (if t G W) 
or x{t) — x{t — 1) (if t e Z), to be less that some given rate m{t). Let M (t) denote 
the primitive of this prescribed rate function. Then the rate constraint on x becomes 
(4.25). Function M{t) is not known, but we assume that there is some function 
7 G JT such that 

M{t) — M{s) > 7 (f — s) 
for any 0 < s < f , which we can recast as 

/iM > C'j- (4.32) 

This is used in [41] to derive a service curve offered by the complete system to the 
incoming flow x, which we shall also compute now by applying Theorem 4.3.1. 

With the additional constraint (4.25), one has to compute the maximal solution 
of 
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X < a A {H{x) -\- W} A hM{x), (4.33) 

which is 

a; = ({U4-W}AhM)(a). (4.34) 

As in the previous subsection, we do not know II but we assume that it is isotone 
and that II > C^. We also know that Hm > Cj. A first approach to get a service 
curve for y, is to compute a lower bound of the right hand side of (4.34) by time- 
invariant linear operators, which commute as we have seen earlier in this chapter. 
We get 

{n -b kk} A Hm > {C(3 -PW} ACj = C{/3+w}a 75 
and therefore (4.34) becomes 

X > Cip+w}A'f(a) = ^ {/3+w}A7 (”) = i{l3 + W} Aj)^a. 

Because of Theorem 3.1.11, 

{/) -b Ik} A 7 = (/) -b Ik) ® 7 

so that 

y > (3 ® X > {j3 ® {(3 -P W) ® 7 ^ ® a 

and thus a service curve for flow a is 

I3 ® JpTw) ® 7 . (4.35) 

Unfortunately, this service curve can be quite useless. For example, if for some 
T > 0, 7 (f) = 0 for 0 < t < T, then 7 (f) = 0 for all f > 0, and so the service 
curve is zero. 

A better bound is obtained by differing the lower bounding of /i m by the time- 
invariant operator Cj after having used the idempotency property in the computation 
of the sub-additive closure of the right hand side of (4.34), via Corollary 4.2.1. 
Indeed, this corollary allows us to replace (4.34) by 

X = [{Hm o (n -b Ik)) o (a)- 
Now we can bound Hm below by C-y to obtain 

{Hm ° (n -b Ik)) o Hm > (C 7 ° Cpp-w) ° C 7 

= C70(/3-|-IV) ° C 7 

“ ^]Wf+W ° ^7 

= c 

70(/307-|-IV) ■ 

We obtain a better service curve than by our initial approach, where we had directly 
replaced Hm by C.y: 

/^wfc 2 (/3®7 + W^)- (4-36) 

is a better service curve than (4.35). 

For example, if (3 = I3 r^t and 7 = (3rpt', with R > R' and Ik < R'{T -P 
T'), then the service curve of the closed-loop system is the function represented on 
Figure 4.3. 
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Figure 4.3: The service curve of fh® closed-loop system with window 
flow control (bottom right), when the service curve of the open loop system 
is /3 = Pr,t (top left) and when 7 = (top right), with R > R' and 

IT < i?'(T + T'). 



Packetized greedy shaper 

Our last example in this chapter is the packetized greedy shaper introduced in Sec- 
tion 1.7.4. It amounts to computing the maximnm solntion to the problem 

X < R A Vl{x) a C„{x) 

where R is the inpnt flow, cr is a “good” function and L is a given sequence of 
cumulative packet lengths. 

We can apply Theorem 4.3.1 and next Theorem 4.2.2 to obtain 
X = Vl A Ca{R) = Vl ° Ca{R) 
which is precisely the result of Theorem 1 .7.4. 

4.4 Fixed Point Equation (Time Method) 

We conclude this chapter by another version of Theorem 4.3.1 that applies only to 
the disrete-time setting. It amonnts to compute the maximum solution T = 11(a) of 
(4.22) by iterating on time t instead of interatively applying operator II to the full 
trajectory a{t). We call this method the “time method” (see also [10]). It is valid 
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Operator 


c. 


-Fa 


St 


h(j 


Vl 


Upper semi-continuous 


yes 


no 


yes 


yes 


yes 


Lower semi-continuous 


no 


yes 


yes 


no 


no 


Isotone 


yes 


yes 


yes 


yes 


yes 


Min-plus linear 


yes 


no 


yes 


yes 


no 


Max-plus linear 


no 


yes 


yes 


no 


no 


Causal 


yes 


no 


yes (1) 


yes 


yes 


Shift-invariant 


yes 


yes 


yes 


no 


no 


Idempotent 


no (2) 


no (2) 


no (3) 


yes 


yes 



(1) (ifT > 0) 

(2) (unless cr is a ‘good’ function) 

(3) (unless T = 0) 

Table 4.1: A summary of properties of some common operators 



under stronger assumptions than the space method, as we require here that operator 
n be min-plus linear. 

Theorem 4.4.1. Let II = Cjj be a min-plus linear operator taking , with 

impulse response H € . For any fixed function a € , the problem 

X < a A jCh(x) (4.37) 

has one maximum solution, given by 

£*(0) = a(0) 

3^ (t) = a(0) A inf u) + £*(u)}. 

0<u<t — l 

Proof: Note that the existence of a maximum solution is given by Theorem 4.3.1 . 

Define X* by the recursion in the Theorem. As H G it follows easily by induction 
that is a solution to problem (4.37). Conversely, for any solution x, x{0) < 
a(0) = x*(0) andif£(u) < x*(u) forallO <u< f— 1, it follows that £(f) < x*{t) 
which shows that x* is the maximal solution. □ 



4.5 Conclusion 

This chapter has introduced min-plus and max-plus operators, and discussed their 
properties, which are summarized in Table 4.5. The central result of this chapter, 
which will be applied in the next chapters, is Theorem 4.3.1, which enables us to 
compute the maximal solution of a set of inqualities involving the iterative applica- 
tion of an upper semi-continuous operator. 





Chapter 5 

Optimal Multimedia 
Smoothing 



In this chapter we apply network calculus to smooth multimedia data over a network 
offering reservation based services, such as ATM or RS VP/IP, for which we know 
one minimal service curve. One approach to stream video is to act on the quantiza- 
tion levels at the encoder output: this is called rate control, see e.g. [22], Another 
approach is to smooth the video stream, using a smoother fed by the encoder, see 
e.g. [61, 64, 52]. In this chapter, we deal with this second approach. 

A number of smoothing algorithms have been proposed to optimize various per- 
formance metrics, such as peak bandwidth requirements, variability of transmission 
rates, number of rate changes, client buffer size [25]. With network calculus, we are 
able to compute the minimal client buffer size required given a maximal peak rate, 
or even a more complex (VBR) smoothing curve. We can also compute the minimal 
peak rate required given a given client buffer size. We will see that the scheduling 
algorithm that must be implemented to reach these bounds is not unique, and we will 
determine the full set of video transmission schedules that minimize these resources 
and achieve these optimal bounds. 



5.1 Problem Setting 

A video stream stored on the server disk is directly delivered to the client, through 
the network, as shown on Figure 5.1. At the sender side, a smoothing device reads 
the encoded video stream R{t) and sends a stream x{t) that must conform to an 
arrival curve a, which we assume to be a ‘good’ function, i.e. is sub-additive and 
such that (t( 0) =0. The simplest and most popular smoothing curve in practice is a 
constant rate curve (or equivalently, a peak rate constraint) a = Xr for some r > 0. 

We take the transmission start as origin of time: this implies that x{t) = 0 for 
t < 0. 



J.-Y. Le Boudec and P. Thiran, Network Calculus, LNCS 2050, pp. 185—206, 2001. 
© Springer- Verlag Berlin Heidelberg 2001 
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Figure 5.1: Video smoothing over a single network. 



At the receiver side, the video stream R will be played back after D units of 
times, the playback delay: the output of the decoding buffer B must therefore be 
R(t-D). 

The network offers a guaranteed service to the flow x. If y denotes the output 
flow, it is not possible, in general, to express y as a function of x. However we as- 
sume that the service guarantee can be expressed by a service curve (3. For example, 
as we have seen in Chapter 1, the IETF assumes that RSVP routers offer a rate- 
latency service curve (i of the form PL,c{t) = C[t — I/]+ = max{0, C{t — L)}. 
Another example is a network which is completely transparent to the flow (i.e. which 
does not incur any jitter to the flow nor rate limitation, even if it can introduce a fixed 
delay, which we ignore in this chapter as we can always take it into account sepa- 
rately). We speak of a null network. It offers a service curve (3{t) = So{t). 

To keep mathematical manipulations simple, we assume that the encoding buffer 
size is large enough to contain the full data stream. On the other hand, the receiver 
(decoding) buffer is a much more scarce resource. Its finite size is denoted by B. 

As the stream is pre-recorded and stored in the video server, it allows the 
smoother to prefetch and send some of the data before schedule. We suppose that the 
smoother is able to look ahead data for up to d time units ahead. This look-ahead de- 
lay can take values ranging from zero (in the most restrictive case where no prefetch- 
ing is possible) up to the length of the full stream. The sum of the look-ahead delay 
and playback delay is called the total delay, and is denoted by T: T = D -\- d. 

These constraints are described more mathematically in Section 5.2. 

We will then apply Theorem 4.3.1 to solve the following problems: 

(i) we first compute, in Section 5.3, the minimal requirements on the playback delay 
D, on the look-ahead delay d, and on the client buffer size B guaranteeing a lossless 
transmission for given smoothing and service curves a and (3. 

(ii) we then compute, in Section 5.4, all scheduling strategies at the smoother that 
will achieve transmission in the parameter setting computed in Section 5.3. We call 
the resulting scheduling “optimal smoothing”. 

(iii) in the CBR case {a = A^), for a given rate r and for a rate-latency service 
curve {(3 = (3l,c), we will obtain, in Section 5.5, closed-form expressions of the 
minimal values of D,T = D -\- d and B required for lossless smoothing. We will 
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also solve the dual problem of computing the minimal rate r needed to deliver video 
for a given playback delay D, look-ahead delay d and client buffer size B. 

We will then compare optimal smoothing with greedy shaping in Section 5.6 and 
with separate delay equalization in Section 5.7. Finally, we will repeat problems (i) 
and (iii) when intermediate caching is allowed between a backbone network and an 
access network. 



5.2 Constraints Imposed by Lossless Smoothing 

We can now formalize the constraints that completely define the smoothing problem 
illustrated on Figure 5.1). 

• Flow X e JF: As mentioned above, the chosen origin of time is such that 
x{t) =0 for t < 0, or equivalently 

x{t) < So{t). (5.1) 

• Smoothness constraint: Flow x is constrained by an arrival curve a{-). This 
means that for alH > 0 

x{t) < {x ® a){t) = C„{x){t). (5.2) 

• Playback delay constraint (no playback buffer underflow): The data is 

read out from the playback buffer after D unit of times at a rate given by 
R{t — D). This implies that y{t) > R{t — D). However we do not know 
the exact expression of y as a function of x. All we know is that the network 
guarantees a service curve j3, namely that y(t) > (a; ® The output 

flow may therefore be as low as {x ® and hence we can replace y in 

the previous inequality to obtain (a; ® /3)(f) > R{t — D). Using Rule 14 in 
Theorem 3.1.12, we can recast this latter inequality as 

x(t)>(R(Z)l3)(t-D)=Vp(R)(t-D) (5.3) 

for all t >0. 

• Playback buffer constraint (no playback buffer overflow): The size of the 
playback buffer is limited to B, and to prevent any overflow of the buffer, we 
must impose that y{t) — R{t — D) < B for all t >0. Again, we do not know 
the exact value of y, but we know that it can be as high as x, but not higher, 
because the network is a causal system. Therefore the constraint becomes, for 
all t > 0, 

x{t) < R{t- D) + B. (5.4) 

• Look-ahead delay constraint: We suppose that the encoder can prefetch data 
from the server up to d time units ahead, which translates in the following 
inequality: 



x{t) < R{t + d). 



(5.5) 
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5.3 Minimal Requirements on Delays and Playback 
Buffer 

Inequalities (5.1) to (5.5) can be recast as two sets of inequalities as follows: 

x{t) < So{t) A R{t + d) A {R{t — D) + B} A C„{x){t) (5.6) 

x(t) > (R(Z)l3)(t- D). (5.7) 

There is a solution x to the smoothing problem if and only if it simultaneously 
verifies (5.6) and (5.7). This is equivalent to requiring that the maximal solution of 
(5.6) is larger than the right hand side of (5.7) for all t. 

Let us first compute the maximal solution of (5.6). Inequality (5.6) has the form 

X < a A Ca-(x) (5.8) 

where 

a{t) = So{t) A R{t + d)A {R{t - D) + B}. (5.9) 

We can thus apply Theorem 4.3.1 to compute the unique maximal solution of (5.8), 
which is a;max = C^ia) = a ^ a because cr is a ‘good’ function. Replacing a by its 
expression in (5.9), we compute that the maximal solution of (5.6) is 

= a(t) A {{(T ® R){t + d)} A {{(T ® R){t - D) + B} . (5.10) 

We are now able to compute the smallest values of the playback delay D, of 
the total delay T and of the playback buffer B ensuring the existence of a solution 
to the smoothing problem, thanks to following theorem. The requirement on d for 
reaching the smallest value of D is therefore d = T — D. 

Theorem 5.3.1 (Requirements for optimal smoothing). The smallest values of 
D, T and B ensuring a lossless smoothing to a ‘good’ curve a through a network 
offering a service curve (3 are 

h(R, (P ® a)) = inf {f > 0 : (R(Z) (P ^ a))(-t) < 0} (5.11) 
h{{R(Z)R),{l3(»a)) (5.12) 

inf {t >0 : {{R (Z) R) (Z) {(3 ® a))(-t) < 0} 
v{{R 0 R), {(3 0 a)) = {{R Z) R) Z) (l3 Z) cr))(0). (5.13) 

where h and v denote respectively the horizontal and vertical distances given by 
Definition 3.1.15. 

Proof: The set of inequalities (5.6) and (5.7) has a solution if, and only if, the 

maximal solution of (5.6) is larger or equal to the right hand side of (5.7) at all 
times. This amounts to impose that for alH € IR 



Hrain — 
T • — 

min — 
^min — 
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{R(Z)l3){t- D) 


< 


0 


(R 0 I3)(t -D)-{(t® R)(t + d) 


< 


0 


(R 0 I3)(t -D)-{(t® R)(t - D) 


< 


B. 


Using the deconvolution operator and its properties, the 
be recast as 


latter 


three inequalities can 


(i?0 (/3 0cr))(-U) < 


0 




((i? 0 i?) 0 (/3 0 cr)) (-T) < 


0 




((i?0i?)0(/3 0cr))(O) < 


B. 





The minimal values of D,T and B satisfying these three inequalities are given by 
(5.11), (5.12) and (5.13). These three inequalities are therefore the necessary and 
sufficient conditions ensuring the existence of a solution to the smoothing problem. 

□ 



5.4 Optimal Smoothing Strategies 

An optimal smoothing strategy is a solution x{t) to the lossless smoothing problem 
where D,T = D + d and B take their minimal value given by Theorem 5.3.1. The 
previous section shows that there exists at least one optimal solution, namely (5.10). 
It is however not the only one, as we will see in this section. 

5.4.1 Maximal Solution 

The maximal solution (5.10) requires only the evaluation of an infimum at time t 
over the past values of R and over the future values of R up to time t + 4 min, with 
dniin = Tyain ~ Dmin- Of course, we need the knowledge of the traffic trace R{t) to 
dimension T>min, dmin and Bmin- However, once we have these values, we do not 
need the full stream for the computation of the smoothed input to the network. 

5.4.2 Minimal Solution 

To compute the minimal solution, we reformulate the lossless smoothing problem 
slightly differently. Because of Rule 14 of Theorem 3. 1 . 12, an inequality equivalent 
to (5.2) is 

x{t) > {x (2) (r){t) = Va{x){t). (5.14) 

We use this equivalence to replace the set of inequalities (5.6) and (5.7) by the 
equivalent set 



x{t) < So{t) A R{t + d) A {R{t — D) + B} 

(5.15) 

x(t) > (R(Z)l3)(t- D)\/V^(x)(t). (5.16) 
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One can then apply Theorem 4.3.2 to compute the minimal solution of (5.16), which 
is Xrain = L>^{h) = 60(7 where h{t) = 5o{t) A R{t + d) A {R{t — D) + B} , because 
(T is a ‘good’ function. Eliminating b from these expressions, we compute that the 
minimal solution is 



x^in{t) = {R<Z){P<»(T)){t-D), (5.17) 

and compute the constraints on d, D and B ensuring that it verifies (5.15): one 
would get the very same values of 79 min, Tmin and Bmin given by (5.1 1) (5.12) and 
(5.13). 

It does achieve the values of 79min and Bmin given by (5.11) and (5.13), but 
requires nevertheless the evaluation, at time t, of a supremum over all values of 
R up to the end of the trace, contrary to the maximal solution (5.10). Min-plus 
deconvolution can however be represented in the time inverted domain by a min-plus 
convolution, as we have seen in Section 3.1.10. As the duration of the pre-recorded 
stream is usually known, the complexity of computing a min-plus deconvolution can 
thus be reduced to that of computing a convolution. 

5.4.3 Set of Optimal Solutions 

Any function x G R such that 



a^min <X <X 



max 



and 

X <X ® <7 

is therefore also a solution to the lossless smoothing problem, for the same minimal 
values of the playback delay, look-ahead delay and client buffer size. This gives 
the set of all solutions. A particular solution among these can be selected to further 
minimize another metric, such as the ones discussed in [25], e.g. number of rate 
changes or rate variability. 

The top of Figure 5.2 shows, for a synthetic trace R{t), the maximal solution 
(5.10) for a CBR smoothing curve a(t) = \r{t) and a service curve fl{t) = So{t), 
whereas the bottom shows the minimal solution (5.17). Figure 5.3 shows the same 
solutions on a single plot, for the MPEG trace R{t) of the top of Figure 1.2.4 rep- 
resenting the number of packet arrivals per time slot of 40 ms corresponding to a 
MPEG-2 encoded video when the packet size is 416 bytes for each packet. 

An example of VBR smoothing on the same MPEG trace is shown on Figure 5 .4, 
with a smoothing curve derived from the T-SPEC field, which is given by cr = 
7p,m a 'Yr,b, where M is the maximum packet size (here M = 416 Bytes), P 
the peak rate, r the sustainable rate and b the burst tolerance. Flere we roughly 
have P = 560 kBytes/sec, r = 330 kBytes/sec and b = 140 kBytes The service 
curve is a rate-latency curve Pl,c with L = 1 second and r = 370 kBytes/sec. 
The two traces have the same envelope, thus the same minimum buffer requirement 
(here, 928kBytes). However the second trace has its bursts later, thus, has a smaller 
minimum playback delay (D 2 = 2.05s versus Di = 2.81s). 
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Figure 5.2; In bold, the maximal solution (top figure) and minimal solution (bottom 
figure) to the CBR smoothing problem with a null network. 

5.5 Optimal Constant Rate Smoothing 

Let us compute the above values in the case of a constant rate (CBR) smoothing 
curve a{t) = Xr{t) = N (with t > 0) and a rate-latency service curve of the 
network /3(f) = = C[t — L]~^. We assume that r < C, the less interesting 

case where r > C being handled similarly. We will often use the decomposition of 
a rate-latency function as the min-plus convolution of a pure delay function, with a 
constant rate function: /3 l,c = ® Ac. We will also use the following lemma. 
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Figure 5.3; In bold, the maximal and minimal solutions to the CBR smoothing prob- 
lem of an MPEG trace with a null network. A frame is generated every 40 msec. 
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Figure 5.4: Two MPEG traces with the same arrival curve (left). The corresponding 
playback delays Di and D 2 are the horizontal deviations between the cumulative 
flows R{t) and function a (g) (3 (right). 
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Lemma 5.5.1. If f e 

M/,/3l.c) =i+^(/0Ac)(O). (5.18) 



Proof: As f{t) = 0 for t <0 and as /3 ^ c = 0 Ac, we can write for any f > 0 

if C) (3L,c){-t) = sup{/(u -f) - (<5 l 0 Ac)(m)} 

ii>0 

= sup{/(u -f)- Xc{u - L)} 

u>0 

= sup{f{v) - Xc{v +t - L)} 

v>—t 

= sup{f{v) - Xc{v + t - L)} 

v>0 

= sup{f{v) - Xc{v)} - C{t - L) 

v>0 

= {f<ZXcm-Ct + CL, 

from which we deduce the smallest value of t making the left-hand side of this 
equation non-positive is given by (5 . 1 8). □ 

In the particular CBR case, the optimal values (5.11), (5.12) and (5.13) become 
the following ones. 

Theorem 5.5.1 (Requirements for CBR optimal smoothing). If a = Xr and 

(3 = I3l,c with r < C, the smallest values of D, ofT and of B are 



^min 


= L-f -(i?0A^)(O) 
r 


(5.19) 


T • 

min 


= L+-{{R(Z)R)(Z)Xr){0) 
r 


(5.20) 


^min 


= ((i?0i?) 0A^))(L) < rTmin. 


(5.21) 



Proof: To establish (5.19) and (5.20), we note that R and {R (d R) G R- Since 

r <C 

/3 0 (T = (3l,C ^ Xr = Sl ^ Xc ^ Xr = Sl ^ Xr = Pl,t 

SO that we can apply Lemma 5.5.1 with f = R and f = (R<Z) R), respectively. 

To establish (5.21), we develop (5.13) as follows 
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((i?0i?)0(/3®cr))(O) = 



< 



((i?0i?)0(<5L0A^))(O) 
sup{(i? 0 R){u) — Xr{u — L)} 

u>0 

((i?0i?) 0A^)(L) 

sup{(i? 0 R){u) — Xr{u — L)} 

u>L 

sup{(i? 0 R){u) — Xr{u)} + rL 

u>L 

sup{(i? 0 R){u) — Xr{u)} + rL 

u>0 

{{R 0 i?) 0 A^)(0) +rL = rTmi, 



□ 

This theorem provides the minimal values of playback delay D min and buffer 
Bniin, as well as the minimal look-ahead delay dmin = Tmin — ^^min for a given 
constant smoothing rate r < C and a given rate-latency service curve (i l,c- We can 
also solve the dual problem, namely compute for given values of playback delay 
D, of the look-ahead delay d, of the playback buffer B and for a given rate-latency 
service curve the minimal rate rmin which must be reserved on the network. 

Theorem 5.5.2 (Optimal CBR smoothing rate). If a = X^ and (3 = I3 l,c with 
r < C, the smallest value ofr, given D > L, d and B > {R (2) R){L), is 



Tmin 



sup 

i>0 



m 

t D — L 




(R0R)(t) ] 
t D d — L j 



V sup 
t>o 



f^ {R(Z)R){t + L)-B 



}■ 



(5.22) 



Proof: Let us first note that because of (5.19), there is no solution if < L. On 

the other hand, if > L, then (5.19) implies that the rate r must be such that for all 

f > 0 

D > L — {R{f) — rf) 
r 

or equivalently r > R{t) l(t + D — L). The latter being true for all f > 0, we must 
have r > supj>o{J?(f)/(f + D — L)}. Repeating the same argument with (5.20) 
and (5.21), we obtain the minimal rate (5.22). □ 

In the particular case where L = 0 and r < C the network is completely trans- 
parent to the flow, and can be considered as a null network: can replace fl{t) by 
So{t). The values (5.19), (5.20) and (5.21) become, respectively. 



^min 


= -(i?0A^)(O) 
r 


(5.23) 


T • 

min 


= -((i?0i?) 0 A^)(0) 
r 


(5.24) 


^min 


= ((i?0i?) 0A^))(O) =rTmi„. 


(5.25) 
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It is interesting to compute these values on a real video trace, such as the hrst 
trace on top of Figure 1.2.4. Since Bmin is directly proportional to Tmin because 
of (5.25), we show only the graphs of the values of Dmin and dmin = Tmin — 
Dyain, as a function of the CBR smoothing rate r on Figure 5.5. We observe three 
qualitative ranges of rates: (i) the very low ones where the playback delay is very 
large, and where look-ahead does not help in reducing it; (ii) a middle range where 
the playback delay can be kept quite small, thanks to the use of look-ahead and (iii) 
the high rates above the peak rate of the stream, which do not require any playback 
nor lookahead of the stream. These three regions can be found on every MPEG 
trace [71], and depend on the location of the large burst in the trace. If it comes 
sufficiently late, then the use of look-ahead can become quite useful in keeping the 
playback delay small. 





Figure 5.5: Minimum playback delay T>min and corresponding look-ahead delay 
dniin for a constant rate smoothing r of the MPEG-2 video trace shown on top of 
Figure 1.2.4. 



5.6 Optimal Smoothing versus Greedy Shaping 

An interesting question is to compare the requirements on D and B, due to the 
scheduling obtained in Section 5.4, which are minimal, with those that a simpler 
scheduling, namely the greedy shaper of Section 1.5, would create. As cr is a ‘good’ 
function, the solution of a greedy shaper is 



^shaper(f) — (o' ® 



(5.26) 
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To be a solution for the smoothing problem, it must satisfy all constraints listed 
in Section 5.2. It already satisfies (5.1), (5.2) and (5.5). Enforcing (5.3) is equivalent 
to impose that for alH € IR 



(R(Z)l3){t-D) < (cr®i?)(f), 



which can be recast as 



((i?0i?) 0 (/3®cr)) (-£))< 0. (5.27) 

This implies that the minimal playback delay needed for a smoothing using a greedy 
shaping algorithm is equal to the minimal total delay Tmin, the sum of the playback 
and lookahead delays, for the optimal smoothing algorithm. It means that the only 
way an optimal smoother allows to decrease the playback delay is its ability to look 
ahead and send data in advance. If this look-ahead is not possible (d = 0) as for 
example for a live video transmission, the playback delay is the same for the greedy 
shaper and the optimal smoother. 

The last constraint that must be verified is (5.4), which is equivalent to impose 
that for all f e IR 

(a 0 R)(t) < R(t - D) + B, 

which can be recast as 

((R^a)(Z)R)(D) <B. (5.28) 

Consequently, the minimal requirements on the playback delay and buffer using 
a greedy shaper instead of an optimal smoother are given by the following theorem. 

Theorem 5.6.1 (Requirements for greedy shaper). If a is a ‘ good’ function, then 
the smallest values of D and B for lossless smoothing of flow Rby a greedy shaper 
are 

^shaper ~ ^min = h{{R 0 i?), (/) 0 O')) (5.29) 

^shaper ~ ^ shaper) ^ shaper)]- (5.30) 

Proof: The expressions of Hgj^gpgj. and ^shaper follow immediately from (5.27) 

and (5.28). The only point that remains to be shown is that S shaper ^ shaper)’ 
which we do by picking s = u in the inf below: 

f^shaper “ 



< 



(i? 0 (i? 0 (t)) (dghaper) 

®>0 {o<^<l"-fz3.hape, ^shaper “ «)} “ -R(w)| 

®>0 -^shaper ~ u) ~ 

shaper ) • 

□ 
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Consequently, a greedy shaper does not minimize, in general, the playback 
buffer requirements, although it does minimize the playback delay when look-ahead 
is possible. Figure 5.6 shows the maximal solution a; max of the optimal shaper (top) 
and the solution a;gjjaper of the greedy shaper (bottom) when the shaping curve is a 
one leaky bucket affine curve a = 'Yr,b, when the look-ahead delay d = 0 (no look 
ahead possible) and for a null network (/3 = <5o)- In this case the playback delays 
are identical, but not the playback buffers. 




(a) Optimal smoothing solution, with 
playback buffer requirements 




(a) Greedy shaper solution, with 
playback buffer requirements 



Figure 5.6: In bold, the maximal solution (top hgure) and minimal solution (bottom 
figure) to the smoothing problem with a null network, no look-ahead and an affine 
smoothing curve a = jr,b ■ 
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Another example is shown on Figure 5.7 for the MPEG-2 video trace shown 
on top of Figure 1.2.4. Flere the solution of the optimal smoother is the minimal 
solution Xniin- 





Figure 5.7: Example of optimal shaping versus optimal smoothing for the MPEG-2 
video trace shown on top of Figure 1 .2.4. The example is for a null network and 
a smoothing curve a = A 'Yr,b with M = 416 bytes, P = 600 kBytes/sec, 
r = 300 kBytes/sec and 6 = 80 kBytes. The figure shows the optimal shaper [resp. 
smoother] output and the original signal (video trace), shifted by the required play- 
back delay. The playback delay is 2.76 sec for optimal shaping (top) and 1.92 sec 
for optimal smoothing (bottom). 



There is however one case where a greedy shaper does minimize the playback 
buffer: a constant rate smoothing (a = Xr) over a null network (/3 = <5o). Indeed, in 
this case, (5.25) becomes 



lin — fTmin — , 
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and therefore Sg^aper ~ Anin- Consequently, if no look-ahead is possible and if 
the network is transparent to the flow, greedy shaping is an optimal CBR smoothing 
strategy. 

5.7 Comparison with Delay Equalization 

A common method to implement a decoder is to first remove any delay jitter caused 
by the network, by delaying the arriving data in a delay equalization buffer, be- 
fore using the playback buffer to compensate for fluctuations due to pre-fetching. 
Figure 5.8 shows such a system. If the delay equalization buffer is properly config- 
ured, ifs combinafion wifh the guaranteed service network results into a fixed delay 
nefwork, which, from fhe viewpoinf we lake in Ihis chapler, is equivalent to a null 
network. Compared to the original scenario in Figure 5.1, there are now two separate 
buffers for delay equalization and for compensation of prefetching. We would like 
to understand the impact of this separation on the minimum playback delay D i„in . 




Figure 5.8: Delay equalization at the receiver. 



The delay equalization buffer operates by delaying the first bit of data by an 
initial delay Dgg , equal to the worst case delay through the network. We assume that 
the network offers a rate-latency service curve (i ^ c. Since the flow x is constainted 
by the arrival curve a which is assumed to be a ‘good’ function, we know from 
Theorem 1 .4.4, that the worst-case delay is 

Deq = h{a,(3L,c)- 

On the other hand, the additional part of the playback delay to compensate for 
fluctuations due to pre-fetching, denoted by Dp/, is given by (5.1 1) with (3 replaced 

by (5o: 

Dpf = h{R, So ^ a) = h{R, a). 

The sum of these two delays is, in general, larger than the optimal playback de- 
lay (without a separation between equalization and compensation for prefetching), 
^min, given by (5.11): 



^min — (^L,C ^7") . 



200 



CHAPTER 5. OPTIMAL MULTIMEDIA SMOOTHING 



Consider the example of Figure 5.9, where a = ‘jr,b with r < C . Then one easily 
computes the three delays Dmin, Dgq and Dpf, knowing that 

I3l,c ® cr = ® Ac ® 'Yr,b = Sl^ (Ac A jr,b) 

= {Sl ® Ac) A {Sl ® Jr,b) = (iL,C A {5l ® ^r,b)- 

One clearly has T>min < Dgq + Dpf: separate delay equalization gives indeed a 
larger overall playback delay. In fact, looking carefully at the figure (or working out 
the computations), we can observe that the combination of delay equalization and 
compensation for prefetching in a single buffer accounts for the busrtiness of the 
(optimally) smoothed flow only once. This is another instance of the “pay bursts 
only once” phenomenon, which we have already met in Section 1.4.3. 




Figure 5.9: Delays D^ain, Dgg and Dpf for a rate-latency service curve /3 l,c and an 
affine smoothing curve a = ^r,b ■ 



We must however make - once again - an exception for a constant rate smooth- 
ing. Indeed, if cr = A^ (with r < C), then Dpf is given by (5.23) and Dmin by 
(5.19), so that 



^eq — h(.^r : Pl,c') — ^ 

Dpf = i(i?0A^)(O) 

^^min = L -\ (i?0Ar)(O) 

r 



and therefore Dmin = Dgg + Dpf. In the CBR case, separate delay equalization is 
thus able to attain the optimal playback delay. 
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5.8 Lossless Smoothing over Two Networks 

We now consider the more complex setting where two networks separate the video 
server from the client; the first one is a backbone network, offering a service curve 
Pi to the flow, and the second one is a local access network, offering a service curve 
P 2 to the flow, as shown on Figure 5.10. This scenario models intelligent, dynamic 
caching often done at local network head-ends. We will compute the requirements 
on D, d, B and on the buffer X of this intermediate node in Subsection 5.8.1. 
Moreover, we will see in Subsection 5.8.2 that for constant rate shaping curves 
and rate-latency service curves, the size of the client buffer B can be reduced by 
implementing a particular smoothing strategy instead of FIFO scheduling at the 
intermediate node. 
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Figure 5.10: Smoothing over two networks with a local caching node. 



Two flows need therefore to be computed: the first one x 1 (f) at the input of the 
backbone network, and the second one x 2 (f) at the input of the local access network, 
as shown on Figure 5.10. 

The constraints on both flows are now as follows: 

• Causal flow x \ : This constraint is the same as (5.1), but with x replaced by 

Xi\ 

xi(t)<5o(t), (5.31) 

• Smoothness constraint: Both flows x\ and x^ are constrained by two arrival 
curves cf\ and < 72 : 

xi(t) < (xi ® (Ti){t) (5.32) 

X2{t) < {X2 ®cr2)(f). (5.33) 

• No playback and intermediate server buffers underflow: The data is read 
out from the playback buffer after D unit of times at a rate given by R{t — 
D), which implies that 2/2 (f) > R{t — D). On the other hand, the data is 
retrieved from the intermediate server at a rate given by a; 2 (t ) , which implies 
that j/i (t) > X 2 (t). As we do not know the expressions of the outputs of each 
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network, but only a service curve /3 1 and /32 for each of them, we can replace 
yi by xi ® Pi and y 2 by X 2 ^ P 2 , and reformulate these two constraints by 

X 2 {t) < (xi ig) Pi){t) (5.34) 

X2{t) >{R(Z)p2){t- D). (5.35) 

• No playback and intermediate server buffers overflow; The size of the 
playback and cache buffers are limited to B and X, respectively, and to pre- 
vent any overflow of the buffer, we must impose that yi{t) — X 2 (t) < X and 
y 2 (t) — R{t — D) < B for all t >0. Again, we do not know the exact value 
of yi and y 2 , but we know that they are bounded by a; 1 and X 2 , respectively, 
so that the constraints becomes, for alH > 0, 

xi(t) < X 2 (t) + X (5.36) 

X2{t)<R{t-D)+B. (5.37) 

• Look-ahead delay constraint: this constraint is the same as in the single 
network case: 

x\{t) < R{t + d) . (5.38) 



5.8.1 Minimal Requirements on the Delays and Buffer Sizes for 
Two Networks 

Inequalities (5.31) to (5.38) can be recast as three sets of inequalities as follows: 

x\{t) < So{t) A R{t + d) A {ai ^ xi){t) A {x 2 {t) + X) (5.39) 

X 2 {t) < {R{t - D) + B} A {Pi ^ xi){t) A {(72 ^ X 2 ){t) (5.40) 

X2{t) > {R(Z)p2){t- D). (5.41) 



We use the same technique for solving this problem sa in Section 5.3, except 
that now the dimension of the system J is 2 instead of 1 . 

With T denoting transposition, let us introduce the following notations: 



X{t) = [Xi{t) X2{t)]’^ 

d{t) = [5o{t) A R{t + d) R{t-D)+BY’ 
h{t) = [0 {R(Zp2){t-D)Y 



m 



O'! (f) <^o(f) + X 

Pl{t) (72 (f) 



With these notations, the set of inequalities (5.39), (5.40) and (5.41) can there- 
fore be recast as 



X < d A {E® x) 

X > b. 



(5.42) 

(5.43) 
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We will follow the same approach as in Section 5.3: we first compute the maximal 
solution of (5.42) and then derive the constraints on D,T (and hence d), X and B 
ensuring the existence of this solution. We apply thus Theorem 4.3.1 again, but this 
time in the two-dimensional case, to obtain an explicit formulation of the maximal 
solution of (5.42). We get 

:?max = Cs(a) = (S ® a) (5.44) 

where E is the sub-additive closure of E, which is, as we know from Section 4.2, 

E=inf|E(”H (5.45) 

nSN 



where E^°) = Dq and E^") denotes the nth self-convolution of E. Application of 
(5.45) to matrix E is straightforward, but involves a few manipulations which are 
skipped. Denoting 

a = (Ti ® (72 ® inf -b nx) (5.46) 

nSN I J 

= (Ti ® (72 ® /3l ® /3l -b X, 



we find that 



E = 



(7i A (cK-b X') 
a 



(<7i(8)(72-bA")A(Q: -b 2X) 
(72 A (a -b X) 



and therefore the two coordinates of the maximal solution of (5.42) are 



a:imax(i) = (Ti{t) A {a{t) + X} A {(Ti ^ R){t + d) A {{a^ R){t + d) + X} 
A {((7i ® (72 ® R){t - D) + B + X} 

A{{a^R){t-D) + B + 2X} (5.47) 

a: 2 max(i) = a{t) A {a ^ R){t + d) A {{(72 ^ R){t - D) + B} 

A{{a^R)it-D) + B+X}. (5.48) 



Let us mention that an alternative (and computationally simpler) approach to obtain 
(5.47) and (5.48) would have been to first compte the maximal solution of (5.40), as 
a function of xi, and next to replace X 2 in (5.39) by this latter value. 

We can now express the constraints on A, B, D and d that will ensure that a 
solution exists by requiring that (5.48) be larger than (5.41). The result is stated in 
the following theorem, whose proof is similar to that of Theorem 5.3.1. 

Theorem 5.8.1. The lossless smoothing of a flow to (sub-additive) curves a\ and 
(72, respectively, over two networks offering service curves (3\ and f )2 has a solution 
if and only if the D, T, X and B verify the following set of inequalities, with a 
defined by (5.46): 



(i?0 (a®/32)(-D) 


< 


0 


(5.49) 


((i?0i?) 0(a0/32))(-T) 


< 


0 


(5.50) 


((i?0i?) 0((72 0/32))(O) 


< 


B 


(5.51) 


((i?0i?) 0 (a 0/32)) (0) 


< 


B+X. 


(5.52) 
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5.8.2 Optimal Constant Rate Smoothing over Two Networks 

Let us compute the values of Theorem 5.8.1 in the case of two constant rate (CBR) 
smoothing curves ai = and <72 = Xr 2 ■ We assume that each network offers a 
rate-latency service curve /3, = flLi,Ci, i = 1,2. We assume that r, < Ci In this 
case the optimal values of D, T and B become the following ones, depending on 
the value of X. 

Theorem 5.8.2. Let r = ri A r 2 . Then we have the following three cases depending 
on X: 

(i) If X > rLi, then T>min, Tmin and Bmin are given by 



Rmin 


— L\ + Z/2 H — {R 0 '^r)(0) 
r 


(5.53) 


T ■ 

min 


= L\ + L 2 H — {{.R 0 R^ 0 '^r)(0) 
r 


(5.54) 


^min 


= {{R 0 -R) 0 A^J(L2) V {((R 0 -R) 0 A^)(Li + L 2 ) 


-X} 




< ((-R0-R) 0 A^)(L2). 


(5.55) 



(ii) IfO < X < rL\ then T>min, Tmin and Bmin are bounded by 

X L 

h T/2 + 0 Ai)(0) < Dmin 

r A i-i 

< Li -f L2 + A^)(0) (5.56) 

X 

h T/2 + —{{R 0 i?) 0 Ai)(0) < Tmin 

r A i -1 

< Li -f L2 + ^((i?0i?) 0 A^)(0) (5.57) 

X 

{{R 0 i?) 0 A^)(Li -I- L 2 ) - r2Ti ^ ^min 

^1 

< {{R(Z)R)(Z)\x.){L2) (5.58) 

^1 

(Hi) Let K be duration of the stream. If X = 0 < rLi then Dmin = X. 

Proof One easily verifies that = S(n+i)Li and that = Aci- Since 

Pi = Pli, C l = Sli 0 Aci, and since r = r\ A <C\, (5.46) becomes 

a = Ar 0 mf 0 Aci +nX} 

= 5li 0 inf {5nLi 0 Ar -I- nX] . (5.59) 

nSN 



(i) If Al > rL\, then for t > nL\ 

i^nLi 0 Xr){t) + nX = Xr{t — uLi) + nX = rt + n{X — rLf) > rt = Xr{t) 



whereas for 0 < t < nLi 
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(SnLi ® ^r)(t) + nX = \j.{t — nLi) + nX = nX > nrL\ > rt = Xr(t). 

Consequently, for all f > 0, a{t) > {Sli ® Xr){t). On the other hand, taking 
n = 0 in the infimum in (5.59) yields that a < Sli ^ A^. Combining these two 
inequalities, we get that 

a = Sli ^ Xr 

and hence that 

a ® /32 = ® Ar ® Sl2 ® Xr2 = Sli + L2 ® Ar = /^Li+La.r- (5.60) 

Inserting this last relation in (5.49) to (5.52), and using Lemma 5.5.1 we establish 
(5.53), (5.54) and the equality in (5.55). The inequality in (5.55) is obtained by 
noticing that r 2 > r and that 



{{R(Z)R)(Z)Xr){Li+L2)-X = 



< 

< 



sup{(i? 0 R){u + Li + L 2 ) - ru} — X 

u>0 

sup {{R 0 R){v + L 2 ) - r(v - Li)} - X 

v>Li 

sup{(i? 0 R){v + L 2 ) - rv} + {rLi - X) 

v>0 



((i? 0 i?) 0 A^)(L 2 ). 



(ii) If 0 < X < rLi, the computation of a does not provide a rate-latency curve 

anymore, but a function that can be bounded below and above by the two following 
rate-latency curves: !5li,x/Li < < l^xir,xiLi ■ Therefore, replacing (5.60) by 

5Li+L2 ® A^ < a 0 /)2 < ^x+l2 ® A^, 

L\ r ' Li 

and applying Lemma 5.5.1 to both bounding rate-latency curves (iLi,xiLi and 
(ixir,xiLi > we get respectively the lower and upper bounds (5.56) to (5.58). 

(iii) If Tl = 0 and rL\ > 0 then (5.59) yields that a{t) = 0 for all f > 0. In this 

case (5.49) becomes sup„>o{J?(m — D)} < 0. This is possible only if D is equal to 
the duration of the stream. □ 

It is interesting to examine these results for two particular values of X. 

The first one is X = 00 . If the intermediate server is a greedy shaper whose 
output is 2:2 (f) = (<T 2 0yi)(f), one could have applied Theorem5. 5.1 with (72 = A^ 
and /) = /3i 0 (T 2 0 /)2 = Sli+L 2 ® A ^2 = Pli+L 2 ,t 2 to find out that D and T are 
still given by (5.53) and (5.54) but that B = {{R<Z) R) <Z)Xr){Li 4-1/2) is larger than 
(5.55). Using the caching scheduling (5.48) instead of a greedy shaping one allows 
therefore to decrease the playback buffer size, but not the delays. The buffer X of 
the intermediate node does not need to be infinite, but can be limited to rL 1 . 

The second one is Af = 0. Then whatever the rate r > 0, ifLi > 0, the 
playback delay is the length of the stream, which makes streaming impossible in 
practice. When Li = L 2 = 0 however (in which case we have two null networks) 
X = rLi = 0 is the optimal intermediate node buffer allocation. This was shown 
in [61](Lemma 5.3) using another approach. We see that when L 1 > 0, this is no 
longer the case. 
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5.9 Bibliographic Notes 

The first application of network calculus to optimal smoohting is found in [48], for 
an unlimited value of the look-ahead delay. The minimal solution (5.17) is shown 
to be an optimal smoothing scheme. The computation of the minimum look-ahead 
delay, and of the maximal solution, is done in [71]. Network calculus allows to re- 
trieve some results found using other methods, such as the optimal buffer allocation 
of the intermdiate node for two null networks computed in [61]. 

It also allows to extend these results, by computing the full set of optimal sched- 
ules and by taking into account non null networks, as well as by using more complex 
shaping curves a than constant rate service curves. For example, with the Resource 
Reservation Protocol (RSVP), a is derived from the T-SPEC held in messages used 
for setting up the reservation, and is given by cr = Jp,m /\Tr,b, where M is the max- 
imum packet size, P the peak rate, r the sustainable rate and b the burst tolerance, 
as we have seen in Section 1.4.3. 

The optimal T-SPEC held is computed in [48]. More precisely, the following 
problem is solved. As assumed by the Intserv model, every node offers a service 
of the form Pl,c for some latency L and rate C, with the latency parameter L 
depending on the rate C according to L = ^ -|- E>o. The constants Co and Do 
depends on the route taken by the how throughout the network. Destinations choose 
a target admissible network delay Dnet- The choice of a specihc service curve fl 
(or equivalently, of a rate parameter C) is done during the reservation phase and 
cannot be known exactly in advance. The algorithm developed in [48] computes 
the admissible choices of a = A 7^,?) and of Dnet in order to guarantee that 

the reservation that will subsequently be performed ensures a playback delay not 
exceeding a given value D. 




Chapter 6 



FIFO Systems and Aggregate 
Scheduling 

6.1 Introduction 

Aggregate scheduling arises naturally in many case. Let us just mention here the 
differentiated services framework (Section 2.4) and high speed switches with optical 
switching matrix and FIFO outputs. 

The state of the art for aggregate multiplexing is surprisingly poor. We sum- 
marize this in the form of a conjecture in Section 6.3.1. In this chapter, we give a 
panorama of results, many of them are new. In Section 6.2, we give bounds for a 
service element in isolation that serves several flows in an aggregate. The particular 
service discipline is not known, but we assume that the aggregate receives a strict 
service curve guarantee. Then we are able to bound the service received by any sub- 
flow inside the aggregate. The bound corresponds actually to the worst case where 
a sub-flow receives the lowest priority. 

Then in Section 6.3 we consider a global network using aggregate multiplex- 
ing; given constraints at the inputs of the network, can we obtain some bounds for 
backlog and delay ? An iterative application of Section 6.2 can be used as a solution; 
however, as illustrated in Section 2.4, it does not given very good bounds. We do not 
have a general answer, not even a general method. However, for the particular case 
of a unidirectional ring, we are able to obtain a closed form bound, which shows 
that the ring is always stable (for utilization factors less than 1). 

Then we consider FIFO multiplexing. We expect to find better bounds, and we 
do. However, we find the same pattern as above. First, in Section 6.4, we find some 
bounds for FIFO network elements. If one of the flows is leaky-bucket constrained, 
we have simple closed form expressions, which improve on Section 6.2. However, 
their iterative application in a network setting does not give the best known bounds. 
Here too, we know little about global network effects. In Section 6.5, we analyze 
a particular case, where strong rate limitations at all sources have the effect of pro- 
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viding simple, closed form bounds (but the proof of such a result is extraordinarily 
complicate). 

6.2 General Bounds for Aggregate Scheduling 

Consider two flows multiplexed into the same service curve element; we are inter- 
ested in the service received by one of the two flows, say flow 1. If we have no 
information about arbitration between the two flows, then a worst case is obtained 
by assuming that flow 2 receives low priority. 

Theorem 6.2.1 (Blind multiplexing). Consider a node serving two flows, 1 and 
2, with some unknown arbitration between the two flows. Assume that the node 
guarantees a strict service curve (3 to the aggregate of the two flows. Assume that 
flow 2 is a 2 - smooth. Define j3i{f) := [I3{f)—a2{f)\^ ■ If f3\ is wide-sense increasing, 
then it is a service curve for flow 1. 

Proof: The proof is a straightforward extension of that of Proposition 1.3.4. □ 

We have seen an example in Section 1.3.2: if fl{t) = Ct (constant rate server or 
GPS node) and 0:2 = ^r,b (constraint by one leaky bucket) then the service curve 
for flow 1 is the rate-latency service curve with rate C — r and latency . 

Corollary 6.2.1 (Non preemptive priority node). Consider a node serving two 
flows, H and L, with non-preemptive priority given to flow H. Assume that the 
node guarantees a strict service curve (3 to the aggregate of the two flows. Then the 
high priority flow is guaranteed a service curve f3H{t) = [f3{t) — where 

is the maximum packet size for the low priority flow. 

If in addition the high priority flow is an-smooth, then define (3 l by (3 1,(1) = 
[l3(t) — If /3l is wide-sense increasing, then it is a service curve for the low 

priority flow. 

Proof: The first part is an immediate consequence of Theorem 6.2.1. The second 

part is proven in the same way as Proposition 1.3.4. □ 

If we relax the assumption that the service curve property is strict, then the above 
results do not hold. A counter-example can be built as follows. All packets have the 
same size, 1 data unit, and input flows have a peak rate equal to 1 . Flow 1 sends 
one packet at time 0, and then stops. The node delays this packet forever. With an 
obvious notation, we have, for f > 0: 

Ri(t) = min(f, 1) and R'l(t) = 0 

Flow 2 sends one packet every time unit, starting at time t = 1. The output is a 
continuous stream of packets, one per time unit, starting from time 1. Thus 

Rflt) = {t- 1)+ and R'2(t) = i? 2 (i) 



The aggregate flows are, for f > 0: 
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R(t) = t and R'{t) = {t — 1)* 

In other words, the node offers to the aggregate flow a service curve S i . Obviously, 
Theorem 6.2.1 does not apply to flow 1: if it would, flow 1 would receive a service 
curve (<5i — Ai )■'■ = <5i , which is not true since it receives 0 service. We can interpret 
this example in the light of Section 1.4.4 on Page 36: if the service curve property 
would be strict, then we could bound the duration of the busy period, which would 
give a minimum service guarantee to low priority traffic. We do not have such a 
bound on this example. In Section 6.4 we see that if we assume FIFO scheduling, 
then we do have a service curve guarantee. 



6.3 Stability of a Network with Aggregate Scheduling 

6.3.1 The Open Issue of Stability 

In this section we consider the following global problem: Given a network with 
aggregate scheduling and arrival curve constraints at the input, can we find good 
bounds for delay and backlog ? As it turns out today, this problem is still open. 

A first attempt at solving the problem is based on the previous section and the 
bounding method in Section 2.4.2. We have seen in Section 2.4.2 that finite bounds 
can be found with this method for small utilization factors. When the bounds are 
infinite, we do not know in general whether the delays can become arbitrarily large 
or not. In[3], the author exhibits networks of constant rate, FIFO queues, with leaky 
bucket constraint sources, and utilization factors less than 1 (a “critical” network), 
where the delay for some sources goes to infinity. However, the result is based on 
an incomplete proof. At the time of writing, it appears thus that the following con- 
jecture is still open. 

Assumption and Notation 

• Consider a network with a fixed number I of flows, following fixed paths. 
A network node is modeled as a collection of output buffers, with no con- 
tention other than at the output buffers. Every buffer is associated with one 
unidirectional link that it feeds. 

• Flow i is constrained by one leaky bucket of rate p, and burstiness cr, at the 
input. 

• Inside the network, flows are treated as an aggregate by the network; within an 
aggregate, packets are served according to some unspecified arbitration pol- 
icy. The only constraint we impose on the scheduling is that the aggregate of 
all flows receives a service curve at node m equal to the rate-latency function 
with rate and latency ^ . 

This implies a form of work conservation. Note that we do not require that 
the service curve property be strict. 
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bm accounts for the latency on the link that exits node to; it also account for 
delays due to the scheduler at node to. 

• We write i 9 to to express that node to is on the route of flow i. For any node 
TO, define = J2i3m Pi and let v = minm(rm - 

• The bit rate of the link feeding node to is Cm < +oo. 

We say that such a network is stable if the backlog at any node remains bounded. 

Conjecture 6.3.1 (Stability of sub-critical session oriented network). We conjec- 
ture that, ifv>Q (the network is “subcritical”), any network satisfying the above 
assumptions is stable. 

[32] shows that, for constant rate servers, the problem can be reduced to the 
case where every source i sends a burst 6, instantly at time 0, then sends at a rate 
limited by p, . In the next section we show that the conjecture is true for the case of 
a unidirectional ring. 



6.3.2 The Ring is Stable 

The result was initially obtained in [69] for the case of a ring of constant rate servers, 
with all servers having the same rate. We give here a more general, but simpler form. 

Assumption and Notation We take the same assumptions as in Section 6.3.1 and 
assume in addition that the network topology is a unidirectional ring. More pre- 
cisely: 

• The network is a unidirectional ring of M nodes, labelled 1, ..., M. We use 
the notation to©A: = (to-|-A: — 1) mod M 1 and mQk = {m — k — 1) 
mod M -b 1, so that the successor of node to on the ring is node to © 1 and 
its predecessor is node to © 1. 

• The route of flow i is (0, Tfirst, Tfirst © 1, ...,i.first © (hi — 1)) where 0 is a 
virtual node representing the source of flow i, T first is the first hop of flow i, 
and hi is the number of hops of flow i. At its last hop, flow i exits the network. 
We assume that a flow does not wrap, namely, hi < M. If hi = M, then the 
flow goes around the all ring, exiting at the same node it has entered. 

• Let b = '^m reflect the total latency of the ring. 

• For any node to let = J^i^m ^i- 

Let (Tmax = max^^i and (J = J2i Note that cr^ax < cr < McTmax- 

• Define v = min^ (r^ — ) . 
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• Let = E*.flrst=m Pi p = [C^ ~ E reflects 

the sum of the peak rate of transit links and the rates of fresh sources, minus 
the rate guaranteed to the aggregate of microflows. We expect high values of 
E to give higher bounds. 

Theorem 6.3.1. If V > Q (the network is “sub-critical”) then the backlog at any 
node of the unidirectional ring is bounded by 

M — {IH(JYaa.yi + 6) + (T + 6 



Proof: The proof relies on the concept of chain of busy periods, combined with 

the time stopping method in Section 2.4.2. 

For a node m and a flow i, define as the cumulative amount of data of 

flow i at the output of node to. For to = 0, this defines the input function. Also 
define 

x^{f) = Y,{R^i{f)-RT{t)) (6.1) 

i3m 

thus Xm (t) is the total amount of data that is present in the network at time t and 
will go through node to at some time > t. 

We also define the backlog at node to by 

i3m,i.hrst^m «.first=m i3m 

Now obviously, for all time t and node to: 

qm{t) < Xm{t) (6.2) 

and 

M 

Xm(t) <'^qn(t) (6.3) 

n— 1 

(Step 1) Assume that a finite bound X exists. Consider a time t and a node to 
that achieves the bound: Xm{t) = X. We fix to and apply Lemma 6.3.1 to all nodes 
n. Call s„ the time called s in the lemma. Since Xn{sn) < X, it follows from the 
first formula in the lemma that 

(f - S„)l/ < MfTmax + (6.4) 

By combining this with the second formula in the lemma we obtain 

^ MOmax Fb (n) 

Qnii) < E h + cr^ 

Now we apply Equation (6.3) and note that ~ from which we derive 
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X <M^(Mamax + b) + a + b (6.5) 

(Step 2) By applying the same reasoning as in Section 2.4.2, we find that Equa- 
tion (6.5) is always true. The theorem follows from Equation (6.2). □ 

Lemma 6.3.1. For any nodes m, n (possibly with m = n), and for any time t there 
is some s such that 

f ^mil) ^ tCYi(tf) (t S^V -f fifcTmax “f b 
I qn{t) <{t- s)p + bn + 4”^ 

with 4”^ = E*.flrst=« 'hi- 
proof: By definition of the service curve property at node m, there is some s i 

such that 

i3m i3m,i.hrst^m «.first=m 

which we can rewrite as 

E! ^Ti^) ^ (®l) + t’m{t — Sl) — bm 

i3m i3m 

with 

E [R^st) - k-Hs,)) 

«3m,«.first7^m 

Now the condition {i 9 TO,Lfirst f m} implies that flow i passes through node 
TO — 1, namely, {i B (m — 1)}. Eurthermore, each element in the summation that 
constitutes A is nonnegative. Thus 



E ^T(t) > -Xmei(si) + E E(si) - Sl) - (6.6) 

i3m i3m 

Now combining this with the definition of Xm{t) in Equation (6.1) gives: 

Xm{t) < Xmeiisi) + E “ R^Si)) -rm{t ~ Sl) + 6m 

i3m 

Erom the arrival curve property applied to all micro-flows i in the summation, we 
derive: 

Xm(t) < a;mei(si) - (r™ - si) -I- -I- b^ 

and since Vm — > u and < <Tmax by definition of v and Cmax, we have 
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^m(^) ^ "I" ^^max + 

We apply the same reasoning to node to 0 1 and time si, and so on iteratively 
until we reach node n backwards from to. We thus build a sequence of times s o = 
t, si,S 2 , ...jSj, ..., Sk such that 

^ ^m©(j+l) (^i+1 ) "b ^^max "b ^mQj (6-7) 

until we have m Q k = n. If n = TOwe reach the same node again by a com- 
plete backwards rotation and k = M. In all cases, we have k < M. By summing 
Equation (6.7) for j = 0 to A: — 1 we find the first part of the lemma. 

Now we prove the second part, s = is obtained by applying the service curve 
property to node n and time Sk-i- Apply the service curve property to node n and 
time t. Since t > su-i, we know from Proposition 1.3.2 on Page 24 that we can find 
some s' > s such fhat 

E E Rns')+rn{t-s')-b„ 

i3n i3n,i.hrst^n «.first=n 

Thus 

Qn{t)< E {R7^\t)-Rl^\.s')) + 

i3n,i.hrst^n 

«.first=n 

< (Cn -r„+ - s') + b„ + 4”^ <{t- s')p + K + 4”^ 

fhe second part of the formula follows from s < s' . □ 

Remark: A simpler, but weaker bound, is 

M— {Ma + b) + a + b 

V 

or 

(AAfTmax + 6) + Mamax + b (6.8) 



The special case in [69]: Under the assumption that all nodes are constant rate 

servers of rate equal to 1 (thus Cm = = 1 and bm is the latency of the link to), 

the following bound is found in [69] : 



Bi 



Mb + MVmax 

V 



+ b 



(6.9) 



In that case, we have /r < 1 — i/. By applying Equation (6.8), we obtain the bound 



B2 



Mjlb + -b Mi/] (Tmax 



+ b 



V 
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since 

(6.10) 

and 0 < 1 / < 1, M < M^, we have B 2 < Bi, namely, our bound is better than 
that in [69]. If there is equality in Equation (6.10) (namely, if there is a node that 
receives no transit traffic), then both bounds are equivalent when ly 0. 

6.4 Bounds for a FIFO Service Curve Element 

We give, in this section, the fundamental results for FIFO multiplexing; here, we 
assume that the node guarantees to the aggregate flow a minimum service curve. 
We find some explicit closed forms bounds for some simple cases. The results are 
valid for variable length packets as well as ATM. 

Proposition 6.4.1 (FIFO Minimum Service Curves [17]). Consider a lossless 
node serving two flows, 1 and 2, in FIFO order. Assume that packet arrivals are 
instantaneous. Assume that the node guarantees a minimum service curve (3 to the 
aggregate of the two flows. Assume that flow 2 is a 2 -smooth. Define the family of 
functions /3g by 

Plit) = [l3(t) - a2(t - 6»)] + l{i>0} 

Call Ri (t) , R[ (t) the input and output for flow 1. Then for any 6 >Q 



R'i>Ri^l3l (6.11) 

If j3l is wide-sense increasing, flow 1 is guaranteed the service curve jd] 

The assumption that packet arrivals are instantaneous means that we are either 
in a fluid system (one packet is one bit or one cell), or that the input to the node is 
packetized prior to being handled in FIFO order. 

Proof: We give the proof for continuous time and assume that flow functions are 

left-continuous. All we need to show is Equation (6.11). Call i?, the flow i input, 
R = Ri R 2 , and similarly R \ , R' the output flows. 

Fix some arbitrary parameter 9 and time t. Define 

u := supjr; : R(v) < R'(t)} 



Note that u < t and that 



R{u) < R'{t) and R{u+) > R'{t) (6.12) 

where Rr{u) = inf^>„[i?(r;)] is the limit to the right of R at u. 

(Case 1) consider the case where u = t. It follows from the above and from 
R' < R that R'i(t) = Ri(t). Thus for any 9, we have R'i(t) = Ri(t) /3g(0) 
which shows that R[ (t) > (Ri ® (dl) (t) in that case. 

(Case 2), assume now that u < t. We claim that 
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Ri{u)<R[{t) (6.13) 

Indeed, if this is not true, namely, i?i (u) > R[ (t), it follows from the first part of 
Equation (6.12) that i? 2 (tt) < R' 2 {t)- Thus some bits from flow 2 arrived after time 
u and departed by time t, whereas all bits of flow 1 arrived up to time u have not yet 
departed at time t. This contradicts our assumption that the node is FIFO and that 
packets arrive instantaneously. 

Similarly, we claim that 

{R2)r{u) > R'^it) (6.14) 

Indeed, otherwise x := R'^it) — (i? 2 )r(w) > 0 and there is some vq € (u,f] such 
that for anyu € (u,uo] wehavei? 2 (u) < R'^it) — From Equation (6.12), we can 
hnd some v\ € (u, uq] such that if u € (u, v\] then R\ (v) + R 2 {v) > R'{t) — |. It 
follows that 

RM>R'iii) + \ 

Thus we can hnd some v with Ri{v) > R[{t) whereas R 2 {v) < R^it), which 
contradicts the FIFO assumption. 

Call s a time such that R'(t) > R{s) + j3(t — s). We have R{s) < R'(t) thus 
s <u. 

(Case 2a) Assume that u <t — 9 thus also t — s > 9. From Equation (6.14) we 
derive 

R'l(t) > i?i(s) +/3(f-s) +i? 2 (s) —R' 2 {t) > i?l(s) +/3(f-s) +i? 2 (s) - {R 2 )r{u) 

Now there exist some e > 0 such that u + e < t — 9, thus (i? 2 )r(w) < R 2 {t — 9) 
and 

R[(t) > R\{s) + (3{t — s) — a 2 {t — s — 9) 

It follows from Equation (6.13) that 

> i?i(s) 

which shows that 

R'lii) > Ri{s) + (Slit - s) 

(Case 2b) Assume that u >t — 9. By Equation (6.13): 

R'l (t) > Ri (u) = Ri (u) + l3l(t — u) 



□ 

We cannot conclude from Proposition 6.4. 1 that inf ^ is a service curve. How- 
ever, we can conclude something for the output. 

Proposition 6.4.2 (Bound for Output with FIFO). Consider a lossless node serv- 
ing two flows, 1 and 2, in FIFO order. Assume that packet arrivals are instantaneous. 
Assume that the node guarantees to the aggregate of the two flows a minimum ser- 
vice curve (3. Assume that flow 2 is a 2 -smooth. Define the family of functions as in 
Proposition 6.4.1. Then the output of flow 1 is a\-smooth, with 

alif) = inf (ai 0 Id]) (t) 
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Proof: Observe first that the network calculus output bound holds even if /3 is not 

wide-sense increasing. Thus, from Proposition 6.4.1, we can conclude that (ai ® 
7) 0 /3j is an arrival curve for the output of flow 1. This is true for any 0. □ 

We can apply the last proposition and obtain the following practical result. 

Theorem 6.4.1 (Burstiness Increase due to FIFO). Consider a node serving two 
flows, 1 and 2, in FIFO order. Assume that flow 1 is constrained by one leaky bucket 
with rate r\ and burstiness h\, and flow 2 is constrained by a sub-additive arrival 
curve Q! 2 . Assume that the node guarantees to the aggregate of the two flows a rate 
latency service curve (3 r^t- Call '■= infj>o the maximum sustainable rate 

for flow 2. 

Ifri + r 2 < R, then at the output, flow 1 is constrained by one leaky bucket with 
rate r\ and burstiness h\ with 



and 

B = sup [ci! 2 (f) + rif — Rt] 
t>o 

The bound is a worst case bound. 

Proof: (Step l)Define/3g as in Proposition 6.4.1. Define B 2 = supj>o [a 2 {t) — Rt]. 

Thus B 2 is the buffer that would be required if the latency T would be 0. We hrst 
show the following 

iiO>-^-\-T then for f > 6» : (ilf) = Rt - RT - a 2 {t - 0) (6.15) 

R 

To prove this, call <p{t) the right hand-side in Equation (6. 15), namely, for t > 0 
define <f){t) = Rt — a 2 (t — 0) — RT. We have 

inf (f>{f) = inf [Rv — Q !2 (v) — RT R0] 

t>0 v>0 

From the dehnition of B 2 : 



= 61 -b ri T -b 



inf fit) = -B 2 -\-R0-RT 

t>0 

If 0 > ^ + T then f(f) > 0 for all t > 0. The rest follows from the dehnition of 

Pi 

(Step 2) We apply the second part of Proposition 6.4.1 with 0 = ^ + T. An 
arrival curve for the output of flow 1 is given hy 

Q!i = A^j ,()j 0 Pg 

We now compute af First note that obviously B < B 2 , and therefore Pl(t) = 
Rt — RT — a 2 (t — 0) for t > 0. is thus defined for f > 0 hy 
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al(t) = sup [rit + 61 + ris - Pl(s)] = rU + bi + sup [ris - Pl(s)] 

s>0 s>0 

Define ^(s) := ris — Pl{s). Obviously: 

sup [V'(s)] = ri9 

5G[O,0] 

Now from Step 1 , we have 

sup[^(s)] = sup [ris — i?s + i?T + 0 : 2(5 — 0)] 

s>9 s>9 

= sup [r\v — Rva 2 {v)\ + (ri — R)0 + RT 

v~>0 

From the definition of B, the former is equal to 

sup[V'(s)] =B + {ri- R)0 + RT = nO 

s>9 

which shows the burstiness bound in the theorem. 

(Step 3) We show that the bound is attained. There is a time a 9 such that B = 
{ct2)r{0) — {R — Ti)0. Define flow 2 to be greedy up to time 0 and stop from there 
on: 

f R 2 (t) = 02 (t) for t < 0 

\ R2{t) = {R2)r{0) iOT t > 0 

Flow 2 is 02 -smooth because 02 is sub-additive. Define flow 1 by 

( Ri (t) = rit for t < 0 
\ Ri (t) = rit + bi ioT t > 0 

Flow 1 is Xri,bi -smooth as required. Assume the server delays all bits by T at time 
0, then after time T operates with a constant rate R, until time 9 + 9, when it be- 
comes infinitely fast. Thus the server satisfies the required service curve property. 
The backlog just after time 0 is precisely B + RT. Thus all flow-2 bits that arrive 
just after time 9 are delayed hy ^ + T = 0. The output for flow 1 during the time 
interval {0 + 0,0 + 0 + t] is made of the bits that have arrived in {0,0 + t], thus there 
are rit + b^ such bits, for any t. □ 

The following corollary is an immediate consequence. 

Corollary 6.4.1 (Burstiness Increase due to FIFO). Consider a node serving two 
flows, 1 and 2, in FIFO order. Assume that flow i is constrained by one leaky bucket 
with rate r, and burstiness bi. Assume that the node guarantees to the aggregate 
of the two flows a rate latency service curve I4 r,t- If r\ + V 2 < R, then flow 1 
has a service curve equal to the rate latency function with rate R — V 2 and latency 
T + ^ and at the output, flow 1 is constrained by one leaky bucket with rate r i and 
burstiness b\ with 

»: = ».+n (t + I) 
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Note that this bound is slightly better than the one we used in Corollary 2.4. 1 
onPage 106 (but the assumptions are slightly different). The bound is also better than 
if we apply Theorem 6.2.1. Indeed, in that case, we would obtain the rate-latency 
service curve with the same rate R — r 2 but with a larger latency: T + instead 
of T -b The gain is due to the FIFO assumption. 



6.5 Bounds for a Network of FIFO CBR Servers 

When analyzing a global network, we can use the bounds in Section 6.4, using the 
same method as in Section 2.4. However, as illustrated in [36], the bounds so ob- 
tained are not optimal: indeed, even for a FIFO ring, the method does not find a finite 
bound for all utilization factors less than (although we know from Section 6.3.2 that 
such finite bounds exist). 

In this section is Theorem 6.5.1, we show in Theorem 6.5.1 some partial result 
that goes beyond the per-node bounds in Section 6.4. The result was originally found 
in [13, 46, 75]. 

6.5.1 Closed Form Bounds for an ATM Network with Strong 
Source Rate Conditions 

Consider an ATM network with the assumptions as in Section 6.3.1, with the fol- 
lowing differences 

• Every link has one origin node and one end node. We say that a link / is 
incident to link e if the origin node of link e is the destination node of link /. 
In general, a link has several incident links. 

• All packets have the same size (called cell). All arrivals and departures oc- 
cur at integer times (synchronized model). All links have the same bit rate, 
equal to 1 cell per time unit. The service time for one cell is 1 time unit. The 
propagation times are constant per link and integer. 

• All links are FIFO. 

Proposition 6.5.1. For a network with the above assumption, the delay for a cell c 
arriving at node e over incident link i is bounded by the number of cells arriving on 
incident links j i during the busy period, and that will depart before c. 

Proof: Call R'{f) (resp. Rj{t), R{t))the output flow (resp. input arriving on link 

j, total input flow). Call d the delay for a tagged cell arriving at time t on link i. 
Call Aj the number of cells arriving on link j up to time t that will depart before the 
tagged cell, and let A = Aj . We have 



d = A- R'{t) <A- R{s) -{t-s) 
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where s is the last time instant before the busy period at t. We can rewrite the 
previous equation as 

d < - s) 

Now the link rates are all equal to 1, thus Ai — Ri{s) <t — s and 

d < — J?j(s)] 



□ 

An “Interference Unit” is defined as a set (e, {j, k}) where e is a link, {j, k} is a 
set of two distinct flows that each have e on their paths, and that arrive at e over two 
different incident links (Figure 6.1). The Route Interference Number (RIN) of flow 
j is the number of interference units that contain j . It is thus the number of other 
flows that share a common sub-path, counted with multiplicity if some flows share 
several distinct sub-paths along the same path. The RIN is used to define a sufficient 
condition, under which we prove a strong bound. 



flowi 




Figure 6.1: The network model and definition of an interference unit. Fiows j 
and have an interference unit at node /. Flows j and ii have an interfer- 
ence unit at node I and one at node g. 



Definition 6.5.1 (Source Rate Condition). The fresh arrival curve constraint (at 
network boundary ) for flow j is the stair function where R is the RIN of 

flow j. 

The source rate condition is equivalent to saying that a flow generates at most 
one cell in any time interval of duration RIN -|- 1. 

Theorem 6.5.1. If the source rate condition holds at all sources, then 

1. The backlog at any node is bounded by N — max j Ni, where Ni is the number 
of flows entering the node via input link i, and N = Ni. 
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Figure 6.2; A time-space diagram illustrating the definitions of d 4g ci and 
Cl 4f C 2 - Time flows downwards. Rectangles illustrate busy periods. 



2. The end-to-end queuing delay for a given flow is bounded by its RIN. 

3. There is at most one cell per flow present during any busy period. 

The proof of item 3 involves a complex analysis of chained busy periods, as 
does the proof of Theorem 6.3.1. It is given in Section 6.5.2. Item 3 gives an intu- 
itive explanation of what happens: the source rate condition forces sources to leave 
enough spacing between cells, so that two cells of the same flow do not interfere, 
in some sense. The precise meaning of this is given in Section 6.5.2. Items 1 and 2 
derive from item 3 by a classical network calculus method (Figure 6.4). 

6.5.2 Proof of Theorem 6.5.1 

As a simplification, we call “path of a cell" the path of the flow of the cell. Similarly, 
we use the phrase “interference unit of c” with the meaning of interference unit of 
the flow of c. 

We define a busy period as a time interval during which the backlog for the flow 
at the node is always positive. We now introduce a definition (super-chain) that will 
be central in the proof. First we use the following relation: 

Definition 6.5.2 (“Delay Chain” [13]). For two cells c and d, and for some link e, 
we say that c =^e d if c and d are in the same busy period at e and c leaves e before 
d. 



Figure 6.2 illustrates the definition. 
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Definition 6.5.3 (Super-Chain [13]). Consider a sequence of cells c = (cq, ..., 
Cj, ..., Ck) and a sequence of nodes f = (/i, ..., fk)- We say that (c, /) is a super- 
chain if 

• fi, ■■■, fk are all on the path P of cell cq (but not necessarily consecutive) 

• Ci-i ^f. Cifori = 1 to k. 

• the path of cell Cifrom f to is a sub-path of P 

We say that the suh-path of cq that spans from node /i to node fk is the path of 
the super-chain. 

Definition 6.5.4 (Segment Interfering with a Super-Chain). For a given super- 
chain, we call “segment" a couple (d, P) where P is a sub-path of the path of the 
super-chain, d is a cell whose path also has P as a sub-path, and P is maximal 
( namely, we cannot extend P to be a common sub-path of both d and the super- 
chain). We say that the segment {d, P) is interfering with super-chain (c, /) if there 
is some i on P such that d =^f. Ci. 

Lemma 6.5.1. Let (c, /) be a super-chain. Let sq be the arrival time of cell cq at 
link fi and s'j, the departure time of cell Ckfrom link fk. Then s'j, — so < R\^k +2^1,*. 
where R\^k ^ the total number of segments interfering with (c, /) and T\^k R the 
total transmission and propagation time on the path of the super-chain. 

Proof: Consider first some node fj on the super-chain. Let sj-i (resp. tj) he the 

arrival time of cell cj-i (resp. cj) at the node. Let t'j_i (resp. s'j) he the departure 
time of cell cj-i (resp. Cj) (Figure 6.3). Let vj he the last time slot before the busy 
period that tj is in. By hypothesis, Uj -I- 1 < Sj_i . Also define Bj (resp. Bj) as the 




Figure 6.3: The notation used in the proof of Lemma 6.5.1 . 



set of segments {d, P) where d is a cell arriving at the node after time vj on a link 
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incident to the path of the super-chain (resp. on the path of the super-chain) and that 
will depart no later than cell cj, and where P is the maximal common sub-path for 
d and the super-chain that fj is in. Also define Aj as the subset of those segments 
in Bj for which the cell departs after cj-i . Let Bj (resp. Bj ,A*j) be the number of 
elements in Bj (resp. Bj,A^), see Figure 6.3. 

Since the rate of all incident links is 1, we have 

Bj -A°< Sj-i - Vj 

Also, since the rate of the node is 1, we have: 

Sj — Vj = Bj + Bj 



Combining the two, we derive 

Sj — Sj—i = Bj T Bj — {sj—i — Vj') < Bj T Aj (6.16) 

By iterative application of Equation (6.16) from j = 1 to k, we obtain 

k 

s'k~ So < "^{Bj + A^) + Ti^u 

i=i 

Now we show that all sets in the collection {Bj, A^, j = 1 to k} are two-by-two 
disjoint. Firstly, if {d, P) € Bj then fj is the first node of P thus {d, P) cannot be in 
some other Bj' with j ^ j' . Thus the Bj are two-by-two disjoint. Second, assume 
{d, P) e Bj and {d, P) € Aj, . It is obvious from their definitions that, for a fixed 
j, Bj and Aj are disjoint; thus j ^ j' . Since fj is the first node of P and j' is on P, 
it follows that j < j' . Now d leaves fj before Cj and leaves fj, after Cj<_i, which 
contradicts the FIFO assumption. Thus the Bj and Aj, are two-by-two disjoint. The 
same reasoning shows that it is not possible that {d, P) € Aj fj Aj, with j < j' . 

Now, by definition, every segment in either Bj or Aj is an interfering segment. 
Thus 

k 

Y^{Bj + A^)<R,^u 



Proposition 6.5.2. Assume the source rate condition holds. Let (c, /) be a super- 
chain. 

1. For every interference unit of co there is at most one cell interfering with the 
super-chain. 

2. Ck does not belong to the same flow as cq- 
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Proof: Define the time of a super-chain as the exit time for the last cell Ck on the 

last node fj. . We use a recursion on the time t of the super-chain. 

If f = 1, the proposition is true because any flow has at most one cell on a link 
in one time slot. Assume now that the proposition holds for any super-chain with 
time < f — 1 and consider a super-chain with time t. 

First, we associate an interference unit to any segment {d, P) interfering with 
the sub-chain, as follows. The paths of d and cq may share several non contiguous 
sub-paths, and P is one of them. Call / the first node of P. To d we associate the 
interference unit (/, {jo,j}), where jo (resp. j) is the flow of cq (resp. d). 

We now show that this mapping is injective. Assume that another segment 
{d',P') ^ (d,P) is associated with the same interference unit (/, {jo, j})- Without 
loss of generality, we can assume that d was emitted before d' . d and d' belong to 
the same flow j, thus, since P and P' are maximal, we must have P = P' . By hy- 
pothesis, have an interference with the super-chain at a node on P. Let // be a node 
on the super-chain and on P such that d /, c/ . If d' leaves node // before c/ , then 
d =^/, d' , and thus ( (d, d' ),(//)) is a super-chain. Since d' is an interfering cell, nec- 
essarily, it must leave node // before t, thus the proposition is true for super-chain 
( (d, d' ),(//)) , which contradicts item 2. Thus d' must leave node // after cell c/ . But 
there is some other index m <k such that d Cm, thus cell d' leaves node /„ 
before cell Cm. Define I' as the smallest index with I < V < m such that d' leaves 
node /// after cell c//_i and before cv . Then ((d, ci, ..., c//_i, d'), (//,••, //')) is a 
super-chain with time < f — 1 which would again contradict item 2 in the proposi- 
tion. Thus, in all cases we have a contradiction, the mapping is injective, and item 1 
is shown for the super-chain. 

Second, let us count a bound on the maximum queuing delay of cell cq. Call 
uo its emission time, Pq the sub-path of cq from its source up to, but excluding, 
node /i, and T the total transmission and propagation time for the flow of cq. The 
transmission and propagation time along Pq is thus T — Ti ^,. By Proposition 6.5.1, 
the queuing delay of cq at a node / on Pq is bounded by the number of cells d cq 
that arrive on a link not on Pq. By the same reasoning as in the previous paragraph, 
there is at most one such cell d per interference unit of cq at /. Define R as the 
number of interference units of the flow of cq on Pi . We have thus 

So < Uo + R + T — (6.17) 

Similarly, from Lemma 6.5.1, we have 

sJi; < So -f Ri,k + 

Call R' the number of interference units of the flow of cq on the path of the super- 
chain. It follows from the first part of the proof that Ri^k < 7?', thus 

s'k ^ So + R' + Ti^k 
Combining with Equation (6.17) gives 

Sk— ^0 T P T R' T T 



( 6 . 18 ) 
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Now by the source condition, if Ck belongs to the flow of cq, its emission time u' 
must satisfy 

^ Uo -h R ~h R' 1 

and thus 

R is! + 1 + T' 

which contradicts Equation (6.18). This shows that the second item of the proposi- 
tion must hold for the super-chain. □ 

Proof of Theorem 6.5.1: Item 3 follows from Proposition 6.5.2, since if there 

would be two cells d, d' of the same flow in the same busy period, then {{d, d'), (e)) 
would be a super-chain. 

Now we show how items 1 and 2 derive from item 3. Call a*{t) the maximum 
number of cells that may ever arrive on incident link i during t time units inside 
a busy period. Since Ai is a service curve for node e, the backlog B at node e is 
hounded hy 

■ I 

-t 

.*=1 

Now by item 3, a* (t) < Ni and thus 

a*{t) < ai{t) := min[A^,,f] 



B < sup 

i>0 



Thus 

■ / 

-t 

.*=1 

Now define a renumbering of the Ni’s such that < N^ 2 ) < ••• < ^(i)- 
function — t is continuous and has a derivative at all points except the 

Nfi'j’s (Figure 6.4). The derivative changes its sign at (=maxi<,</(A^,)) thus 
the maximum is at and its value is , which shows item 1 . 

From Item 1 , the delay at a node is bounded by the number of interference units 
of the flow at this node. This shows item 2. □ 



B < sup 

i>0 



6.6 Bibliographic Notes 

In [46], a stronger property is shown than Theorem 6.5.1: Consider a given link 
e and a subset A of m connections that use that link. Let n be a lower bound on 
the number of route interferences that any connection in the subset will encounter 
after this link. Then over any time interval of duration m + n, the number of cells 
belonging to A that leave link e is bounded by m. 

It follows from item 1 in Theorem 6.5.1 that a better queuing delay bound for 
flow j is: 
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Figure 6.4: Derivation of a backlog bound. 



^U) = 



E 

e such that eEj 



i such that 



(Nie) - Ni(e)) 



where 7(e) is the number of incident links at node e, Ni{e) is the number of flows 
entering node e on link i, and N = In other words, the end- 

to-end queuing delay is bounded by the sum of the minimum numbers of route 
interference units for all flows at all nodes along the path of a flow. For asymmetric 
cases, this is less than the RIN of the flow. 



6.7 Exercises 

Exercise 6.1. Consider the same assumptions as in Section 6.3.2 but with a linear 
network instead of a ring. Thus node m feeds node m + 1 for m = 1 , . . . , M — 1; node 
1 receives only fresh traffic, whereas all traffic exiting node M leaves the network. 
Assume that all service curves are strict. Find a bound which is finite for u < 1. 
Compare to Theorem 6.3.1. 

Exercise 6.2. Consider the same assumptions as in Theorem 6.5.1. Show that the 
busy period duration is bounded by N . 



Chapter 7 

Adaptive and Packet Scale 
Rate Guarantees 



7.1 Introduction 

In Chapter 1 we defined a number of service curve concepts: minimum service 
curve, maximum service curve and strict service curves. In this chapter we go be- 
yond and define some concepts that more closely capture the properties of general- 
ized processor sharing (GPS). 

We start by a motivating section, in which we analyze some features of ser- 
vice curves that do not match GPS. Then we provide the theoretical framework of 
adaptive guarantees, which was first proposed in Okino’s dissertation in [55] and by 
Agrawal, Cruz, Okino and Rajan in [1]. This framework is underlying the concept 
of packet scale rate guarantees, which is used in the definition of the Internet Ex- 
pedited Forwarding service. We explain the relationship between the two and give 
practical applications. 

In all of this chapter, we assume that flow functions are left-continuous, unless 
stated otherwise. 



7.2 Adaptive Guarantee 

7.2.1 Limitations of the Service Curve Abstraction 

The definition of service curve introduced in Section 1.3 is an abstraction of nodes 
such as GPS and its practical implementations, as well as guaranteed delay nodes. 
This abstraction is used in many situations, described all along this book. However, 
it is not always sufficient. 

Firstly, it does not provide a guarantee over any interval. Consider for example 
a node offering to a flow R{t) the service curve Ac- Assume R{t) = B for f > 0, 
so the flow has a very large burst at time 0 and then stops. A possible output is 
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illustrated on Figure 7.1. It is perfectly possible that there is no output during the 
time interval (0, even though there is a large backlog. This is because the 

server gave a higher service than the minimum required during some interval of 
time, and the service property allows it to be lazy after that. 




Figure 7.1: The service curve property is not sufficient. 



Secondly, there are case where we would like to deduce a bound on the delay 
that a packet will suffer given the backlog that we can measure in the node. This is 
used for obtaining bounds in FIFO systems with aggregate scheduling. In Chapter 6 
we use such a property for a constant delay server with rate C: given that the backlog 
at time t is Q, the last bit present at time t will depart before within a time of 0. If 
we assume instead that the server has a service curve A c , then we cannot draw such 
a conclusion. Consider for example Figure 7.1: at time t > 0, the backlog, e, can be 
made arbitrily small, whereas the delay — t can be made arbitrarily large. 

A possible fix is the use of strict service curve, as defined in Definition 1.3.2 
on Page 27. Indeed, it follows from the next section (and can easily be shown in- 
dependently) that if a FIFO node offers a strict service curve /3, then the delay at 
time t is bounded by ((3(f)), where Q(t) is the backlog at time t, and is the 
pseudo-inverse (Definition 3.1.7 on Page 129). 

We know that the GPS node offers to a flow a strict service curve equal of the 
form Xji- However, we cannot model delay nodes with a strict service curve. Con- 
sider for example a node with input R{t) = et, which delays all bits by a constant 
time d. Any interval [s, t] with s > dis within a busy period, thus if the node offers 
a strict service curve (3 to the flow, we should have (3{t — s)e{t — s), and e can 
be arbitrarily small. Thus, the strict service curve does not make much sense for a 
constant delay node. 



7.2.2 Definition of Adaptive Guarantee 

We know introduce a stronger concept, called adaptive guarantee, that better cap- 
tures the properties of GPS [55, 1]. Before giving the formula, we motivate it on 
three examples. 
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Consider first a node offering a strict service curve /3. Consider some fixed, but 
arbitrary times s < t. Assume that fl is continuous. If [s, t] is within a busy period, 
we must have 

R*{t) > R*{s) + I3{t - s) 

Else, call u the beginning of the busy period at t. We have 

R*{t) > R{u)+l3{t-u) 



thus in all cases 

R*{t)>{R*{s)+l3{t-s))A inf {R{u) + l3{t - u)) (7.1) 

Second, consider a node that guarantees a virtual delay < d. If t — s < d then 
trivially 

R*{t) > R*{s) +Sd{t - s) 

and ift — s>d then the virtual delay property means that 

R*{t) > R{t — d) = inf {R{u) + 5d{t — u)) 

ue[s,t] 

thus we have the same relation as in Equation (7.1) with (i = 84 . 

Thirdly, consider a greedy shaper with shaping function a (assumed to be a good 
function). Then 

R*{t) = inf [i?(u) + a{t — u)] 

U<t 

Breaking the inf into u < s and u > s gives 

R* (t) = mf[R{u) + a{t — u)] A inf [R{u) + a{t — u)] (7.2) 



Define a := a 0 a, namely. 



(t(u) = mf[a(t + u) — (7(u)] 



(7.3) 



Eor example, for a piecewise linear concave arrival curve (conjunction of leaky 
buckets), a{t) = min,(rjU + 6,), we have a{u) = min, ViU. Back to Equation (7.2), 
we have 

a(t — u) > a(s — u) + a(t — s) 

and finally 



R*{t)>{R*{s) + d{t-s))A inf {R{u) + (j{t - u)) (7.4) 

«G[s,i] 



We see that these three cases fall under a common model: 
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Definition 7.2.1 (Adaptive Service Curve). Let I3,l3 be in T. Consider a system 
S and a flow through S with input and output functions R and R*. We say that S 
offers the adaptive guarantee (/3, /3) if for any s <t it holds: 

R* {t) > { R* {s) + j3{t — s)\ A inf \R{u) + fi{t — u)] 

If (3 = (3 we say that the node offers the adaptive guarantee f3. 

The following proposition summarizes the examples discussed above: 

Proposition 7.2.1. * If S offers to a flow a strict service curve (3, then it also 

offers the adaptive guarantee (3. 

• If S guarantees a virtual delay bounded by d, then it also offers the adaptive 
guarantee Sd 

• A greedy shaper with shaping curve a, where a is a good function, offers the 
adaptive guarantee {a, a), with d defined in Equation (7.3). 

Similar to [55], we use the notation R {(3,(3) -A- R* to express that Defini- 
tion 7.2.1 holds. If /3 = /3 we write R ^ (fl) ^ R* . 

Assume that R is left-continuous and (3 is continuous. It follows from Theo- 
rem 3.1.8 on Page 139 that the adaptive guarantee is equivalent to saying that for all 
s < f, we have either 

R*{t)-R*{s) >(3{t-s) 
or 

R*{f) > R{u)+(3{t-u) 

for some u € [s, i]. 

7.2.3 Properties of Adaptive Guarantees 

Theorem 7.2.1. Let R {(3,(3) R*. If (3 < (3 then (3 is a minimum service 

curve for the flow. 

Proof: Apply Definition 7.2.1 with s = 0 and use the fact that (3 < (3. □ 

Theorem 7.2.2 (Concatenation). If R ^ {(3\,(3\) R\ and R\ {(32,(32) 

R* then R ^ {(3, (3) R* with 

/3 = (^/3i ® (32^ A 02 



and 



(3 = (3i® (32 
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Proof: Consider some fixed but arbitrary times s < t and let u G [s, i]- We have 

Ri(u) > (s) + /3(u — s)l A inf [R(v) + l3i(u — v)] 

thus 

i?i (u) + ^ 2(1 - u) > |^i?i (s) + P{u - s) + ^ 2(1 - w) j A 

inf^G[«.«] +Pi{u-v) + fl 2 {t - u)] 

and 



inf [i?i (u) +^ 2 ( 1 - m)] > 

inf \Ri{s) + P{u - s) + fl 2 {t - u)] 

«G[s,i] L J 

A inf [R{v) + Pi{u - v) + ^ 2(1 - u)] 



After re-arranging the infima, we find 

inf [i?i (u) + (32 (t - u)] > 

^/G[5,r] 



«JG[s,i] 

which can be rewritten as 



Ri{s) + inf \l3{u - s) + I32{t - u) 
inf |i?(r;)-|- inf [l3i{u - v) + l32{t 

uE[v,t] 




inf [i?i (u) + (32{t - m)] > 

(^Ri{s) + 01 ® I32){t - s)j A 
inf [R{v) + l3{t — v)] 

ve[s,t] 

Now by hypothesis we have 

R*{t) > (r*{s) + 02{t - s)') A inf [i?(u) + f32{t - u)] 

\ / «G[s,i] 



Combining the two gives 



R*{t) > 

(^R*{s) +/32{t - s)j A (^Ri{s) + 01 ® I32){t - s)j 
A inf + l3{t — r;)] 



Now Ri (s) > R* (s) thus 
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R*{t) > 

(^R*{s) + /32{t- s)j A (^R*{s) + 01 ® j32)(t - s)j 
A inf [R{v) + l3{t — i;)] 



□ 

Corollary 7.2.1. IfRi-i 0i,Pi) Rifori = Iton then Ro 0,0 Rn 
with 

j3= ^/fi ® /32 ® ■■■ ® /3n) A {02 ... ® A ... A {0n-l ® Pn'j A 0n 

and 

13 = 0® ... ®!3n 



Proof: Apply Theorem 7.2.2 iteratively and use Rule 6 in Theorem 3.1.5 on 

Page 135. □ 

Theorem 7.2.3 (Delay from Backlog). IfR 0,0 R*, then the virtual delay 

at time t is bounded by j3~^ (Q(t)), where Q(t) is the backlog at time t, and j3~^ is 
the pseudo-inverse of (3 (see Definition 3.1.7 on Page 129). 

Note that if the node is FIFO, then the virtual delay at time t is the real delay for 
a bit arriving at time t. 

Proof: If the virtual delay at time t is larger than f + r for some r > 0, then we 

must have 

R* (t -I t) < R(t) (7.5) 

By hypothesis 

R* (t -I t) > ( R* (t) -I 0 t)\ A inf + /3(f + r — u)l (7.6) 

\ 1 [ue[t,t+T] 

now for u e [t, t + r] 

R{u) + 0t + r — u) > R{t) + /3(0) > R*(t -I r) 
thus Equation (7.6) implies that 

R*(t + T) > R*(f)+0T) 

combining with Equation (7.5) gives 



Q{f) = R{f) - R*{f) >00 
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thus the virtual delay is hounded by sup{r : fI{T) > Q{t)} which is equal to 
p-HQ(t)). □ 

Consider a system {bit-by-bit system) with L-packetized input R and bit-by-bit 
output R*, which is then L-packetized to produce a final packetized output R' . We 
call combined system the system that maps R into R' . Assume both systems are 
first-in-first-out and lossless. Remember from Theorem 1.7.1 that the per-packet 
delay for the combined system is equal the maximum virtual delay for the bit-by-bit 
system. 

Theorem 7.2.4 (Packetizer and Adaptive Guarantee). If the bit-by-bit system 
offers to the flow the adaptive guarantee (/3, jS), then the combined system offers to 
the flow the adaptive guarantee (/)', f)') with 

P'{t) = 0{t) - )„,ax] + 

and 

13' (t) = [I3(t) - )„,ax] + 

where /max E the maximum packet size for the flow. 

Proof: Let s < f . By hypothesis we have 

R*{t) > {r*{s) -\- j3{t — t\ inf [R{u) -\- !3{t — u)] 

\ / aG[s,i] 

We do the proof when the inf in the above formula is a minimum, and leave it to the 
alert reader to extend it to the general case. Thus assume that for some uq € [s, f]: 

inf [R(u) -\- (3{t — u)] = R(uo) + (3{t — uf) 

ue[s,t] 

it follows that either 

R*(f) -R*{s) >i3(t-s) 

or 

R*{f) > R(uo) + (3{t - uo) 

Consider the former case. We have R'{f) > R*{f) — /max and R'{s) < R*{s) thus 
R'(t) > R*(t) - /max > R'(s) -h l3(t -s) - /max 
Now also obviously R'{t) > R'{s), thus finally 

R'{t) > R'{s) -\- max[0,/3(f — s) — /max] = R'{s) + j3'{t — s) 

Consider now the latter case. A similar reasoning shows that 
R'(t) > R(uo) + l3(t - Uo) - /max 



but also 
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R*(t) > R(uo) 

now the input is L-packetized. Thus 

R'(t) = P^(R*(t)) > P^(R(uo)) = R(uo) 

from which we conclude that R'(t) > R{uq) + — uq). 

Combining the two cases provides the required adaptive guarantee. □ 

7.3 Application to the Internet: Packet Scale Rate 
Guarantee 

In this section we apply the concept of adaptive guarantee to practical schedulers 
used in the Internet. 

7.3.1 Definition of Packet Scale Rate Guarantee 

In Section 2.1.3 on Page 86 we have introduced the definition of guaranteed rate 
scheduler, which is the practical application of rate latency service curves. Consider 
a node where packets arrive at times A 1 > 0 ,^ 2 , ••• and leave at times T>i , C> 2 , .... 
A guaranteed rate scheduler, with rate r and latency v requires that D , <T'+v, 
where T/ is defined iteratively by Tq = 0 and 

T- = max{Aj,T/_i) + ^ 
where 1, is the length of the ith packet. 

A packet scale rate guarantee is similar, but, much in the spirit of adaptive guar- 
antees, avoids the limitations of the service curve concept discussed in Section 7.2.1. 
To that end, we would like that the deadline T- is reduced whenever a packet hap- 
pens to be served early. This is done by replacing T-_^ in the previous equation by 
min{Tj', T>,}. This gives the following definition. 

Definition 7.3.1 (Packet Scale Rate Guarantee). Consider a node that serves a 
flow of packets numbered i = 1 , 2 ,.... Call Ai,Di,li the arrival time, departure 
time, and length in bits for the ith packet, in order of arrival. Assume Ai > O.We 
say that the node offers to the flow a packet scale rate guarantee with rate r and 
latency v if the departure times satisfy 



Di < Fi+v 



where Fi is defined by: 

( Fo = Do = 0 

\ Fj = max{Aj,min(Oj_i,Fj_i)} -b ^ for alH > 1 



(7.7) 
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We now relate packet scale rate guarantee to an adaptive guarantee. We cannot 
expect an exact equivalence, since a packet scale rate guarantee does not specify 
what happens to bits at a time other than a packet departure or arrival. However, the 
concept of packetizer allows us to establish an equivalence. 

Theorem 7.3.1 (Equivalence with adaptive guarantee). Consider a node S with 
L-packetized input R and with output R*. 

1. If R {ff) R* , where (3 = (3r,v is the rate-latency function with rate r 

and latency v, and if S is FIFO, then S offers to the flow the packet scale rate 
guarantee with rate r and latency v. 

2. Conversely, if S offers to the flow the packet scale rate guarantee with rate 
r and latency v and if R* is L-packetized, then S is the concatenation of a 
node S' offering the adaptive guarantee (3r,v tind the L-packetizer. If S is 
FIFO, then so is S'. 

The proof is long and is given in a separate section (Section 7.3.3). Note that the 
packet scale rate guarantee does not mandate that the node be FIFO; it is possible 
that Di < Di-i in some cases. However, part 1 of the theorem requires the FIFO 
assumption in order for a condition on R,R* to be translated into a condition on 
delays. 

A special case of interest is when r; = 0. 

Corollary 7.3.1. Consider a node with L-packetized input. Call Ai, the arrival 
and departure times for packet i, with i = 1, 2, ... and A\ > 0. Let f be the size of 
packet i. 

1. If the node guarantees a strict service curve and is FIFO then 

f -^0 = 0 

( < max {Aj, ^ for alH > 1 

2. Conversely if Equation (7.8) holds for all i, and if the output is L-packetized, 
then the node is the concatenation of a node guaranteeing a strict service 
curve Xr and an L-packetizer. 



Proof: Apply Theorem 7.3.1 with r; = 0 and note that T>,_i < F,_i in Equa- 
tion (7.7). □ 

Definition 7.3.2. We call minimum rate server, with rate r, a node for which Equa- 
tion (7.8) holds for all i 

Thus, roughly speaking, a minimum rate server guarantees that during any busy 
period, the instantaneous output rate is at least r. A GPS node with total rate C and 
weight Wi for flow i is a minimum rate server for flow i, with rate r , = . 

Since a packetizer does not add increase the per-packet delay, we can immedi- 
ately derive the following property from Theorem 7.2.3 and Theorem 7.3.1: 
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Proposition 7.3.1 (Backlog from Delay). For a FIFO node offering the packet 
scale rate guarantee with rate r and latency v, the delay for a packet present in the 
system at time t is bounded by + v, where Q (f) is the backlog at time t. 

Lastly, we have a concatenation result for FIFO systems: 

Proposition 7.3.2. Consider a concatenation of FIFO systems numbered 1 to n. 
The output of system i — 1 is the input of system i, for i > 1. Assume system i offers 
the packet scale rate guarantee with rate Ri and latency Ei. The global system 
offers the packet scale rate guarantee with rate R = minj^i and latency 

Proof: By Theorem 7.3.1-(2), we can decompose system i into a concatenation 

Si, Vi, where Si offers the adaptive guarantee (3 Ri,Ei and Vi is a packetizer. 

Call S the concatenation 

Sl,Vi,S2,V2, ■■■,Sn-l,Vn-l,Sn 

By Theorem 7.3.1-(2), S is FIFO. By Theorem 7.2.4, it provides the adaptive guar- 
antee (3r^e- By Theorem 7.3.1-(1), it also provides the packet scale rate guarantee 
with rate R and latency E. Now Vn does not affect the finish time of the last hit of 
every packet. 

□ 



7.3.2 Practical Realization of Packet Scale Rate Guarantee 

We show in this section that a wide variety of schedulers provide the packet scale 
rate guarantee. More schedulers can be obtained by using the concatenation theorem 
in the previous section. 

A simple but important realization is the priority scheduler. 

Proposition 7.3.3. Consider a non-preemptive priority scheduler in which all pack- 
ets share a single FIFO queue with total output rate C. The high priority flow re- 
ceives a packet scale rate guarantee with rate C and latency v = where Imax 
is the maximum packet size of all low priority packets. 

Proof: By Proposition 1.3.7, the high priority traffic receives a strict service curve 

(3r,c- □ 

We have already introduced in Section 2.1.3 a large number of schedulers that 
can be thought of as derived from GPS and we have modeled their behaviour with 
a rate-latency service curve. In order to give an adaptive guarantee for such sched- 
ulers, we need to define more. 

Definition 7.3.3 (Accuracy of a scheduler with respect to rate r). Consider a 
scheduler S and call Di the time of the i-th departure. VFe say that the accuracy of 
S with respect to rate r is (vi , V 2 ) if there is a minimum rate server with rate r and 
departure times Gi such that for all i 
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Gi — vi < Di < Gi + V2 (7.9) 

We interpret this definition as a comparison to a hypothetical GPS reference 
scheduler that would serve the same flows. The term V 2 determines the maximum 
per-hop delay bound, whereas v\ has an effect on the jitter at the output of the 
scheduler. For example, it is shown in [5] that WF^Q satisfies ui(WF^Q) = Imax/'r, 
U 2 (WF^Q) = Imax/G, where Imax is maximum packet size and C is the total 
output rate. In contrast, for POPS [57] U 2 (POPS) = V 2 (WF^Q), while vi (POPS) is 
linear in the number of queues in the scheduler. This illustrates that, while WF^Q 
and POPS have the same delay bounds, POPS may result in substantially burstier 
departure patterns. 

Theorem 7.3.2. If a scheduler satisfies Equation (7.9), then it offers the packet scale 
rate guarantee with rate r and latency v = v\ + V 2 - 



Proof: We hrst prove that for alH > 0 

Fi>Gi-vi (7.10) 

where F, is defined by Equation (7.7). Indeed, if Equation (7.10) holds, then by 
Equation (7.9)): 

Di < Gj + U2 < Fj + ui + U2 

which means that the scheduler offers the packet scale rate guarantee with rate r and 
latency v = v\ + V 2 - 

Now we prove Equation (7.10) by induction. Equation (7.10) trivially holds for 
i = 0. 

Suppose now that it holds for i — 1, namely, 

Fi-i > Gi-i — vi 
By hypothesis. Equation (7.9) holds: 

Di-i > Gi-i — vi 

thus 

min[Fj_i,Fj_i] > Gj_i - vi (7.11) 

Combining this with Equation (7.7), we obtain 

Fi > Gi-i — v\ -{ — (7.12) 
Again from Equation (7.7) we have 



Fi > ^ 

> A, — ^ 



(7.13) 



Now by Equation (7.8) 
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Gi < m3.x[Ai,Gi-i] + - (7.14) 

r 

Combining Equation (7.12)), Equation (7.13)) and (7.14) gives 

Fi ^ Gi — vi 



□ 



7.3.3 Proof of Theorem 7.3.1 

The first part of Theorem 7.3.1 is based on a max-plus representation of the packet 
scale rate guarantee, which maps the (min-plus) definition of an adaptive guarantee. 
The second part relies on the reduction to the minimum rate server. 

We use the same notation as in Definition 7.3.1. L{i) = Ij is the cumula- 

tive packet length. 



Part 1: Define the sequence of times Fj. by Equation (7.7). Consider now some 

fixed but arbitrary packet index i > 1. By the FIFO assumption, it is sufficient to 
show that 

R*{t)>L{i) (7.15) 

with t = Fi + V. Define 



j = max {A: G {1, : Au > Du-i or Au < Du-i < Fu-i) 

Note that the set above is non-empty and 1 < j < T The definition of j implies 



> Dj-i or Aj < Dj-i < Fj-i (7.16) 

and 

Ak < Dk-i and Fk~i < Dk~i iov j + 1 <k <i (7.17) 

Note that the set of indices k to which the previous equation applies may be empty 
(in that case, j = i). 

By Equation (7.16) and the definition of F j, we have 

(7.18) 

r 



with 



s = AjV Dj_i 



Similarly, we derive from Equation (7.17) that for j -|- 1 < A: < i: 



Fk — {Ak V Fk-i) -\ 

r 



which can be re-written as 




1.3. PACKET SCALE RATE GUARANTEE 



239 



Pk — [ 3 1 V ( Fk-1 + 



^k 



(7.19) 



Now we obtain a max-plus expansion of F, as follows. We substitute F,_i from 
Equation (7.19) at A: = i — 1 into Equation (7.19) at A: = i and obtain 



Pi — [ ^i 3 1 V ( Ai-I + 



h + h-1 



V I Pi-2 + 



h + h-1 



We apply this iteratively until k = j at which step we use Equation (7.18) instead 
of Equation (7.19). We obtain finally: 



Pi — S + 



L{i) - L{j - 1) 



\J max ( Ak + 
V k^j+l V 



L{i) - L{k - 1) 



(7.20) 



Let us apply the definition of an adaptive guarantee to the time interval [s, f]: 

R*{t) >AaB 

with 

A := R* (s) + r{t — s — v)~^ and B := inf B{u) 

uE[s,t] 



where 



B{u) := (R{u) + r{t — u — u)"*") 



Eirstly, since s > Dj-i, we have R*{s) > L{j — 1). By Equation (7.20), F, > 
g ^ t>S+ q. Jt follows that 



t — s — V > 



L{{) - L{j - 1) 



and thus A > L{i). 

Secondly, we show that B > L(i) as well. Consider some u € [s, i\. lfu>Ai 
then R{u) > L(i) thus B(u) > L{i). Otherwise, u < A,; since s > Aj, it follows 
that Ak-i < u < Af. for some A: € {j -I- and R{u) = L{k — 1). By 

Equation (7.20), 

r 

thus 

L(i) - L(k - 1) 

t-u-v> — i - 

r 

It follows that B(u) > L{i) also in that case. Thus we have shown that B > L(i). 
Combining the two shows that R*(t) > L{i) as required. 
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Part 2: We use a reduction to a minimum rate server as follows. Let L>' := 

min{Di, Fi) for i > 0. By Equation (7.7) we have 

Ai<D\< max(^i, D'_i) + ^ (7.21) 

and 

D'i<Di<D'i+v (7.22) 

The idea of the proof is now to interpret D ' as the output time for packet i out of a 
virtual minimum rate server. Of course, we cannot use Corollary 7.3.1. 

Construct a virtual node TZ as follows. The input is the original input R{t). The 
output is defined as follows. The number of bits of packet i that are output up to 
time t is (f) , defined by 

( if t > d'{i) then = L{i) 

< else if a{i) <t< d'{i) then = [L{i) — r{d' {i) — f)]"'' 

[ else^j(f)=0 

so that the total output of TZ is Ri{t) = ipiif)- 

The start time for packet i is thus max[A,, 77' — and the finish time is 77'. 
Thus TZ is causal (but not necessarily FIFO, even if the original system would be 
FIFO). We now show that during any busy period, TZ has an output rate at least equal 
to r. 

Let t be during a busy period. Consider now some time t during a busy period. 
There must exist some i such that Ai <t<D[. Let i be the smallest index such that 
this is true. If > 77'_j then by Equation (7.21) D[ — t < and thus = r 
where is the derivative of tpi to the right. Thus the service rate at time t is at least 
r. 

Otherwise, Ai < D'i — 1. Necessarily (because we number packets in order of 
increasing A^s - this is not a FIFO assumption) < A,; since i is the smallest 
index such that Ai < t < 77', we must have t > 77'_j. But then D[ — t < j and 
the service rate at time t is at least r. Thus, node TZ offers the strict service curve 
and 

R (Xr) Ri (7.23) 

Now define node 77. Let 6{i) := Di — 77', so that 0 < 6{i) < E. The input of 
77 is the output of TZ. The output is as follows; let a bit of packet i arrive at time t\ 
we have 7 < 77' < 77,. The bit is output at time 7' = max[min[77j_i,77j],7 + <5j]. 
Thus all bits of packet i are delayed in 77 by at most and if 77j_i < Di they 
depart after Di. It follows that the last bit of packet i leaves 77 at time Di. Also, 
since t' >t,V is causal. Lastly, if the original system is FIFO, then 77j_i < Di, all 
bits of packet i depart after 77j_i and thus the concatenation of TZ and 77 is FIFO. 
Note that TZ is not necessarily FIFO, even if the original system would is FIFO. 

The aggregate output of 77 is 

- S(i)) > 7?i(7 - v) 

i>l 
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thus the virtual delay for V is bounded by v and 

i?i ^ ((5.) ^ i ?2 (7.24) 

Now we plug the output of V into an L-packetizer. Since the last bit of packet i 
leaves V at time Di, the final output is R* . Now it follows from Equation (7.23), 
Equation (7.24) and Theorem 7.2.2 that 

R — ^ (A 7 . $$ ^ R 2 



□ 



7.4 Bibliographic Notes 

The concept of adaptive service curve was introduced in Okino’s dissertation in [55] 
and was published by Agrawal, Cruz, Okino and Rajan in [1], which contains most 
results in Section 7.2.3, as well as an application to a window flow control problem 
that extends Section 4.3.2 on Page 178. They call /3 an “adaptive service curve” and 
/3 a “partial service curve”. 

The packet scale rate guarantee was first defined in a framework dependent of 
adaptive service guarantees in [4]. It serves as a basis for the definition of the Expe- 
dited Forwarding capability defined for the Internet. 



7.5 Exercises 

Exercise 7.1. Assume that R (/3, /3) R*. 

1. Show that the node offers to the flow a strict service curve equal to ® fl, 
where (3 is the sub-additive closure of (3 . 

2. If (3 = (3 is a rate-latency function, what is the value obtained for the strict 
service curve ? 

Exercise 7.2. Consider a system with input R and output R*. We call “input flow 
restarted at time t” the flow Rt defined for u > 0 by 

Rt(u) = R(t -1-u) — R*(t) = R(t,u] -I- Q(t) 

where Q{t) := R{t) — R*{t) is the backlog at time t. Similarly, let the “output flow 
restarted at time t” be the flow Rf defined for u > 0 by 

R*{u) = R*{t-l-u) - R*{t) 



Assume that the node guarantees a service curve (3 to all couples of input, output 
flows (Rt,Rl). Show that R ^ {(3) ^ R* . 




Chapter 8 



Time Varying Shapers 



8.1 Introduction 

Throughout the book we usually assume that systems are idle at time 0. This is not a 
limitation for systems that have a renewal property, namely, which visit the idle state 
infinitely often - for such systems we choose the time origin as one such instant. 

There are cases however where we are interested in the effect at time t of non 
zero initial conditions. This occurs for example for re-negotiable services, where the 
traffic contract is changed at periodic renegotiation moments. An example for this 
service is the Integrated Service of the IETF with the Resource reSerVation Protocol 
(RSVP), where the negotiated contract may be modified periodically [28]. A similar 
service is the ATM Available Bit Rate service (ABR). With a renegotiable service, 
the shaper employed by the source is time-varying. With ATM, this corresponds 
to the concept of Dynamic Generic Cell Rate Algorithm (DGCRA).. At renegoti- 
ation moments, the system cannot generally be assumed to be idle. This motivates 
the need for explicit formulae that describe the transient effect of non-zero initial 
condition. 

In Section 8.2 we define time varying shapers. In general, there is not much we 
can say apart from a direct application of the fundamental min-plus theorems in 
Section 4.3. In contrast, for shapers made of a conjunction of leaky buckets, we can 
find some explicit formulas. In Section 8.3.1 we derive the equations describing a 
shaper with non-zero initial buffer. In Section 8.3.2 we add the constraint that the 
shaper has some history. Lastly, in Section 8.4, we apply this to analyze the case 
where the parameters of a shaper are periodically modified. 

This chapter also provides an example of the use of time shifting. 



8.2 Time Varying Shapers 

We define a time varying shaper as follows. 
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Definition 8.2.1. Consider a flow R{t). Given a function of two time variables 
H{t,s), a time varying shaper /orcei' the output R * (t) to satisfy the condition 

R*{t) < H{t,s)+R*{s) 

for all s < t, possibly at the expense of buffering some data. An optimal time varying 
shaper, or greedy time varying shaper, is one that maximizes its output among all 
possible shapers. 

The existence of a greedy time varying shaper follows from the following propo- 
sition. 

Proposition 8.2.1. For an input flow R{t) and a function of two time variables 
H(t, s), among all flows R* < R satisfying 

R*{t) < H{t,s)+R*{s) 

there is one flow that upper bounds all. It is given by 

R*{t) = inf [H{t, s) + i?(s)] (8.1) 

5>0 



where H is the min-plus closure of H, defined in Equation (4.10) on Page 172. 



Proof: The condition defining a shaper can be expressed as 

{ R* < Ch{R*) 

\ R* <R 

where Ch is the min-plus linear operator whose impulse response is H (Theo- 
rem 4.1.1). The existence of a maximum solution follows from Theorem 4.3.1 and 
from the fact that, being min-plus linear, £h ts upper-semi-continuous. The rest of 
the proposition follows from Theorem 4.2. 1 and Theorem 4.3.1. □ 

The output of the greedy shaper is given by Equation (8.1). A time invariant 
shaper is a special case; it corresponds to H{s,t) = a{t — s), where a is the shaping 
curve. In that case we find the well-known result in Theorem 1.5.1. 

In general. Proposition 8.2.1 does not help much. In the rest of this chapter, we 
specialize to the class of concave piecewise linear time varying shapers. 

Proposition 8.2.2. Consider a set of J leaky buckets with time varying rates r j{f) 
and bucket sizes hj{f). At time 0, all buckets are empty. A flow R{t) satisfies the 
conjunction of the J leaky bucket constraints if and only if for all 0 < s < t: 

R(t) < H(t, s) + R(s) 



with 



H{t,s) 



min ibj(t) + 
i<j<J 




(8.2) 
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Proof: Consider the level of the jth bucket. It is the backlog of the variable ca- 

pacity node (Section 1.3.2) with cumulative function 




rj(u)du 



We know from Chapter 4 that the output of the variable capacity node is given by 



i?' (f) = - Mj(s) + R(s)} 



The jth leaky bucket constraint is 

R{t) - R'jit) < bj{t) 

Combining the two expresses the jth constraint as 

R{t) - R{s) < Mj{t) - Mj{s) + bj{t) 

for all 0 < s < f. The conjunction of all these constraints gives Equation (8.2). 

In the rest of this chapter, we give a practical and explicit computation of H 
for H given in Equation (8.2), when the functions r and bj{t) are piecewise 
constant. 



8.3 Time Invariant Shaper with Non-zero Initial Con- 
ditions 

We consider in this section some time invariant shapers. We start with a general 
shaper with shaping curve a, whose buffer is not assumed to be initially empty. Then 
we will apply this to analyze leaky bucket shapers with non-empty initial buckets. 

8.3.1 Shaper with Non-empty Initial Buffer 

Proposition 8.3.1 (Shaper with non-zero initial buffer). Consider a shaper sys- 
tem with shaping curve a. Assume that a is a good function. Assume that the initial 
buffer content is wq. Then the output R* for a given input R is 

R*(t) = a(t) A {-R(s) + wo + ~ s)} for alH > 0 (§ 3 ) 

Proof: First we derive the constraints on the output of the shaper, a is the shaping 

function thus, for all f > s > 0 

R*{t) <R*{s)-\-a{t-s) 

and given that the bucket at time zero is not empty, for any f > 0, we have that 



R*{t) < R{t) Wo 
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At time s = 0, no data has left the system; this is expressed with 

R*{t)<So{t) 



The output is thus constrained by 

i?* < ((T ® R*) A {R + Wo) A So 

where ® is the min-plus convolution operation, defined by (/ ® g) (t) = inf s f{s) + 
g{t — s). Since the shaper is an optimal shaper, the output is the maximum function 
satisfying this inequality. We know from Lemma 1.5.1 that 

R* = (T ® [{R + Wo) A <5o] 

= [<T ® (i? + Wo)] A [(T ® <5o] 

= |(T ® (i? + Wo)] A a 

which after some expansion gives the formula in the proposition. □ 

Another way to look at the proposition consists in saying that the initial buffer 
content is represented by an instantaneous burst at time 0. 

The following is an immediate consequence. 

Corollary 8.3.1 (Backlog for a shaper with non-zero initial buffer). The backlog 
of the shaper buffer with the initial buffer content wo is given by 

w(t) = (R(t) — a(t) + Wo) V sup {R(t) — R(s) — a(t — s)} (8.4) 

0<s<t 

8.3.2 Leaky Bucket Shapers with Non-zero Initial Bucket Level 

Now we characterize a leaky-bucket shaper system with non-zero initial bucket lev- 
els. 

Proposition 8.3.2 (Compliance with J leaky buckets with non-zero initial bucket 
levels). A flow S{t) is compliant with J leaky buckets with leaky bucket specifica- 
tions (rj,bj),j = 1, 2 ... J and initial bucket level g® if and only if 

S(t) — S(s) < min [vj ■ (t — s) -h bf for all 0 < s < t 
i<j<J 

S(t) < min \rj ■ t -i-bj — q^] for alH > 0 

- J J 31 — 

Proof: Apply Section 8.3.1 to each of the buckets. □ 

Proposition 8.3.3 (Leaky-Bucket Shaper with non-zero initial bucket levels). 

Consider a greedy shaper system defined by the conjunction of J leaky buckets 
(vj , bj), with j = 1, 2 ... J. Assume that the initial bucket level of the j-th bucket is 
Qj. The initial level of the shaping buffer is zero. The output R* for a given input R 
is 

i?*(t) = min[(T°(t), ((T ® i?)(t)] for alH > 0 (8.5) 
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where a is the shaping function 



a{u) = min {(Tj(u)} = min {rj • u + hj) 

1<J<J 



and <7^ is defined as 



(T^iu) = min jr,- • u + — o?| 

i<i<j J 



Proof: By Corollary 8.3.2 applied to 5 = R*, the condition that the output is 

compliant with the J leaky buckets is 

R*(t) — R*(s) < a(t — s) for all 0 < s < f 
R* {t) < for alH > 0 



Since a^(u) < cr(u) we can extend the validity of the first equation to s = 0. Thus 
we have the following constraint: 

R*(t) < [(cr ® R*) A(RA cr°)](f) 



Given that the system is a greedy shaper, R*(-) is the maximal solution satisfying 
those constraints. Using the same min-plus result as in Proposition 8.3.1, we obtain: 

R* = a igi (RA (T°) = ((T ® i?) A (<T ® (T°) 



As (T*^ < (T, we obtain 



R* = (a ^R) A cr° 



□ 

We can now obtain the characterization of a leaky-bucket shaper with non-zero 
initial conditions. 



Theorem 8.3.1 (Leaky-Bucket Shaper with non-zero initial conditions). Con- 
sider a shaper defined by J leaky buckets {r j , bj), with j = 1, 2 ... J (leaky-bucket 
shaper). Assume that the initial buffer level of is wq and the initial level of the jth 
bucket is q^. The output R* for a given input R is 

= min{(T°(t), Wo + inf {i?(u) -I- (r(t — u)}} for alH > 0 (g 6) 



with 



(T^iu) = min (r,- • u -\- b^ — o°) 

i<i<j J J' 



Proof: Apply Proposition 8.3.3 to the input R' = {R -\- wq) A So and observe that 

< a. □ 

An interpretation of Equation (8.6) is that the output of the shaper with non-zero 
initial conditions is either the output of the ordinary leaky-bucket shaper, taking into 
account the initial level of the buffer, or, if smaller, the output imposed by the initial 
conditions, independent of the input. 
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8.4 Time Varying Leaky-Bucket Shaper 

We consider now time varying leaky-bucket shapers that are piecewise constant. The 
shaper is defined by a fixed number J of leaky buckets, whose parameters change 
at times f,. For t € := li, we have thus 

Vj (t) = r] and bj (t) = 6*- 

At times ti, where the leaky bucket parameters are changed, we keep the leaky 
bucket level qj{ti) unchanged. 

We say that <t,(u) := mini<jj{r*u + bj} is the value of the time varying 
shaping curve during interval With the notation in Section 8.2, we have 

H(t,ti) = (Tiit - ti) if f G 

We can now use the results in the previous section. 

Proposition 8.4.1 (Bucket Level). Consider a piecewise constant time varying 
leaky-bucket shaper with output R*. The bucket level qj{t) of the j-th bucket is, 
fort G f: 

qj{t)= [R*{t)-R*{ti)-ry{t-ti)-\-qj{ti)]v 

- R*{s) - rj • (t - s)} 

Proof: We use a time shift, defined as follows. Consider a fixed interval 7, and 

define 

x*{t) ■.= R*{ti-\-T)-R*{ti) 

Observe that qj{ti + r) is the backlog at time r (call it w{t) at the shaper with 
shaping curve a{r) = rj • t, fed with flow x* , and with an initial buffer level qj (ti). 
By Chapter 8.3.1 we have 

w{t) = [a;*(r) - r * • r + gj(t*)] V sup {a;*(r) - a;* (s') - r* • (r - s')} 

0<5'<r 

which after re-introducing R* gives Equation (8.7) □ 

Theorem 8.4.1 (Time Varying Leaky-Bucket Shapers). Consider a piecewise 
constant time varying leaky-bucket shaper with time varying shaping curve a , in 
the interval f. The output R* for a given input R is 



R* (t) = min 



(7°{t - ti) -b R*{ti) 



inf {(Ti{t — s) -b 7?(s)} 

ti <S<t 



( 8 . 8 ) 



with af is defined by 



(u) = r] • u -b 6} - qjif) 
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and Qjiti) is defined recursively by Equation (8.7). The backlog at time t is defined 
recursively by 



w(t) = max 



sup {R{t) — R{s) — afit — s)}, 

U <S<t 

R(t) - R(ti) ti) + w{ti) 



t G h 



(8.9) 



Proof: Use the same notation as in the proof of Proposition 8.4.1 and define in 

addition 

x(t) := R(ti +t) - R(ti) 

We can now apply Theorem 8.3.1, with initial bucket levels equal to qj{ti) as 
given in Equation (8.7) and with an initial buffer level equal to w{ti). The input- 
output characterization of this system is given by Equation (8.6), thus 

x*(t) = (T°(r) A [oi ® x'](t) 



where 



J x(t) +w(ti) 
\ x{t) 



r > 0 
r < 0 



Hence, re-introducing the original notation, we obtain 



R*{t)-R*{ti) 



— inf {(Ti{t — s) + R{s) — R{ti) + w{ti)} 

ti <S<t 



which gives Equation (8.8). 

The backlog at time t follows immediately. □ 

Note that Theorem 8.4.1 provides a representation of H. However, the represen- 
tation is recursive: in order to compute R*{t), we need to compute R*{ti) for all 
ti < t. 



8.5 Bibliographic Notes 

[63] illustrates how the formulas in Section 8.4 form the basis for defining a rene- 
gotiable VBR service. It also illustrates that, if some inconsistency exists between 
network and user sides whether leaky buckets should be reset or not at every rene- 
gotiation step, then this may result in inacceptable losses (or service degradation) 
due to policing. 

[11] analyzes the general concept of time varying shapers. 




Chapter 9 

Systems with Losses 



All chapters have dealt up to now with lossless systems. This chapter shows that 
network calculus can also be applied to lossy systems, if we model them as a lossless 
system preceded by a ‘clipper’ [15, 16], which is a controller dropping some data 
when a buffer is full, or when a delay constraint would otherwise be violated. By 
applying once again Theorem 4.3.1, we obtain a representation formula for losses. 
We use this formula to compute various bounds. The first one is a bound on the loss 
rate in an element when both an arrival curve of the incoming traffic and a minimum 
service curve of the element are known. We use it next to bound losses in a complex 
with a complex service curve (e.g., VBR shapers) by means of losses with simpler 
service curves (e.g., CBR shapers). Finally, we extend the clipper, which models 
data drops due to buffer overflow, to a ‘compensator’, which models data accrual 
to prevent buffer underflow, and use it to compute explicit solutions to Skorokhod 
reflection mapping problem with two boundaries. 



9.1 A Representation Formula for Losses 

9.1.1 Losses in a Finite Storage Element 

We consider a network element offering a service curve /3, and having a finite stor- 
age capacity (buffer) X. We denote by a the incoming traffic. 

We suppose thaf the buffer is not large enough to avoid losses for all possible 
input traffic patferns, and we would like fo compute the amount of data lost at time 
t, with the convention that the system is empty at time t = 0. We model losses as 
shown in Figure 9.1, where x{t) is the data that has actually entered the system in 
the time interval [0, fj. The amount of data lost during the same period is therefore 
L(t) = a(t) — x(t). 

The model of Figure 9. 1 replaces the original lossy element, by an equivalent 
concatenation a controller or regulator that separates the incoming flow a in two 
separate flows, x and L, and thaf we call clipper, following fhe denomination in- 
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troduced in [16], together with the original system, which is now lossless for flow 

X. 



a (t) 



Clipp er 

/ N x(t) 

— u— 

I 

L(t) 



D— ► 



Figure 9.1: System with losses 



The amount of data {x{t) — x{s)) that actually entered the system in any time 
interval {s,t] is always bounded above by the total amount of data (a{t) — a{s)) 
that has arrived in the system during the same period. Therefore, for any 0 < s < f , 
x{t) < x{s) + a{t) — a{s) or equivalently, using the linear idempotent operator 
introduced by Definition 4.1.5, 

x{t) < inf {a{t) — a{s) +a;(s)} = ha(x)(t). (9.1) 

Q<s<t 

On the other hand, x is the part of a that does actually enter the system. If y 
denotes its output, there is no loss for x if x{t) — y{t) < X for any t. We do not 
know the exact mapping y = II (a;) realized by the system, but we assume that II is 
isotone. So at any time t 



x{t) < y{t) +X = n(a;)(f) + X (9.2) 

The data x that actually enters the system is therefore the maximum solution to 
(9.1) and (9.2), which we can recast as 

X < a A {n(a;) + X } A ha(x), (9.3) 

and which is precisely the same equation as (4.33) with W = X and M = a. Its 
maximal solution is given by 

X = ({n + X} A ha)(a), 
or equivalently, after applying Corollary 4.2.1, by 

a: = ((/i„o(n+X)) o ha) (a) = ((/i„ o (n + X))) (a) (9.4) 

where the last equality follows from ha{a) = a. 

We do not know the exact mapping II, but we know that II > C^. We have thus 

that 




9.1. A REPRESENTATION FORMULA FOR LOSSES 



253 



x>{haoCp+x){a). (9.5) 

The amount of lost data in the interval [0, t] is therefore given hy 

L(t) = a(t) — x(t) 

= a{t) - hao {Cp+x}{a){t) = a{t) - inf | {ha o {a){t) 

nGN I J 

= sup |a(f) - {ha oCp+x)^"'' (a)(i)| 

= sup{a(f) — inf {a(f) — a(si) + /3(si — S 2 ) + ^ 

n>0 0<S2n<---<S2<Sl<t 

+a(s 2 ) - • • ■ + a(s 2 n)}} 

= sup{ sup {a(si) - /3(si - S 2 ) - a(s 2 ) 

nGN 0<52n <t 

+ . . . - a{s 2 n) - nX}}. 

Consequently, the loss process can he represented hy the following formula: 

L{t) < 



sup < sup 5 [g(s 2 »-l) - a{s2i) - P{s2i-1 - S2i) - X] 

nGN I 0<52n <---:<52;<5 i <t ! j — 1 

(9.6) 

If the network element is a greedy shaper, with shaping curve j3, then II (a;) = C p, 
and the inequalities in (9.5) and (9.6) become equalities. 

What the formula says is that losses up to time t are obtained by summing the 
losses over all intervals [s 2 *-i , S 2 i], where S 2 i marks the end of an overflow period, 
and where S 2 *-i is the last time before S2i when the buffer was empty. These inter- 
vals are therefore larger then the congestion intervals, and their number n is smaller 
or eqaul to the number of congestion intervals. Figure 9.2 shows an example where 
n = 2 and where there are three congestion periods. 

We will see in the next sections how the losses representation formula (9.6), can 
help us to obtain deterministic bounds on the loss process in some systems. 

9.1.2 Losses in a Bounded Delay Element 

Before moving to these applications, we first derive a representation formula for a 
similar problem, where data are discarded not because of a finite buffer limit, but 
because of a delay constraint: any entering data must have exited the system after at 
most d unit of time, otherwise it is discarded. Such discarded data are called losses 
due to a delay constraint of d time units. 

As above, let x be the part of a that does actually enter the system, and let y be 
its output. All the data x{t) that has entered the system during [0, t] must therefore 
have left at time t + d at the latest, so that x{t) — y{t + d) < 0 for any t. Thus 




x{t) < y{t + d) = U{x){t + d) = {S-d O n)(a;)(f), 



(9.7) 
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Figure 9.2: Losses in a constant rate shaper (/3 = Ac). Fresh traffic a is 
represented with a thin, solid line; accepted traffic x is represented by a 
bold, solid line; the output process y is represented by a bold, dashed line. 



where S-d is the shift operator (with forward shift of d time units) given by Defini- 
tion 4.1.7. 

On the other hand, as in the previous example, the amount of data {x{t)-x{s)) 
that actually entered the system in any time interval {s,t] is always bounded above 
by the total amount of data (a{t) — a{s)) that has arrived in the system during the 
same period. Therefore the data x that actually enters the system is therefore the 
maximum solution to 



X < a A {S-d o n)(a;) A ha{x), 



which is 

X = ({>5_d O n} A ha){a), 
or equivalently, after applying Corollary 4.2.1, by 

X = (ha o {{S-d o n}) o ha^ (a) = {ha o S-d o II) (a). 

Since II > Cp, we also have. 



x> {ha o S-d o Cp) {a). 

The amount of lost data in the interval [0, i\ is therefore given by 
L{t) < sup I a{t) - {ha o S-d o (a) 

nSN ^ 



(9.8) 



(9.9) 



(9.10) 
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which can be developed as 

m < 

sup < sup \'^[a{s 2 i-i) - a{s 2 i) - P{s 2 i -1 + d - S 2 i)] 

nGN |^0<52n<---<52<5l<t 

(9.11) 

Once again, if II = Cp, then (9.1 1) becomes an equality. 

We can also combine a delay constraint with a buffer constraint, and repeat the 
same reasoning, starting from 

X < a A {n(a;) + X} A {S-4 o II) (a;) A ha{x). (9.12) 

to obtain 

n 

L{t) < sup{ sup {y~^[g(s 2 »-i) - a{s 2 i) 

nGN 0<S2n<---<S2<Sl<t 

— {p{s2i-l + d — S2i) A {l 3 {s 2 i-l — S2i) + ^})]}}. (9.13) 

This can be recast as a recursion on time if f € N, following the time method to 
solve (9.12) instead of the space method. This recurstion is established in [15]. 




9.2 Application 1: Bound on Loss Rate 



Let us return to the case of losses due to buffer overflow, and suppose that in this 
section fresh traffic a is constrained by an arrival curve a. 

The following theorem provide a bound on the loss rate l(t) = L{t)ja{t), and 
is a direct consequence of the loss representation (9.6). 



Theorem 9.2.1 (Bound on loss rate). Consider a system with storage capacity X, 
offering a service curve (3 to a flow constrained by an arrival curve a. Then the loss 
rate l(t) = L(t) /a(t) is bounded above by 



iff) = 1 - 



inf 

0<s<t 



fl{s)+X 

a(s) 



(9.14) 



Proof: With Iff) defined by (9.14), we have that for any D <u <v <t, 

!_((()= i„f 

o<s<i a(s) aff — u) aff) — a(u) 

because a{v) — aff) < aff — u) by definition of an arrival curve. Therefore, for 
any 0 < u < v < t, 

aff) — aff) — (3{v — u) — X < Iff) ■ [a(u) — aff)]. 
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For any n € No = {1,2,3, ...}, and any sequence |si;}i<i;< 2 n, with 0 < S 2 n < 
. . . < Si <t, setting v = S 2 i-i and u = S 2 i in the previous equation, and summing 
over i, we obtain 

n n 

[a(s2i-i) - a(s2i) - I 3 (s 2 i -1 - S2i) - ^] < i(t) * ^ [a(s2i-i) - a(s2i)] • 

Because the Sk are increasing with k, the right hand side of this inequality is always 
less than, or equal to, l{t) ■ a{t). Therefore we have 



L(t) < sup < sup \'^[a{s 2 i-i) - a{s 2 i) - P{s 2 i-i - S 2 i) - X] 

KGN {0<S2n<...<Sl<i 

< i(t) • a(t), 

which shows that l{t) > l(t) = L{t)la{t). □ 

To have a bound independent of time t, we take the sup over all t of (9.14), to 
get 

i = supi(t) = 1 — inf (9.15) 

i>o t>o a(t) 

and retrieve the result of Chuang and Chang [14]. 

A similar result for losses due to delay constraint d, instead of finite buffer X, 
can be easily obtained, too: 




i{t) 

i 



1 - inf 

0<s<t 



P{s + d) 
a(s) 



1 - inf 

i>0 



P{t + d) 
a(t) 



(9.16) 

(9.17) 



9.3 Application 2: Bound on Losses in Complex Sys- 
tems 

As a particular application of the loss representation formula (9.6), we show how 
it is possible to bound the losses in a system offering a somewhat complex service 
curve j3, by losses in simpler systems. The first application is the bound on the losses 
in a shaper by a system that segregates the resources (buffer, bandwidth) between 
a storage system and a policer. The second application deals with a VBR shaper, 
which is compared with two CBR shapers. For both applications, the losses in the 
original system are bounded along every sample path by the losses in the simpler 
systems. For congestion times however, the same conclusion does not always hold. 



9.3.1 Bound on Losses by Segregation between Buffer and Po- 
licer 

We will first compare the losses in two systems, having the same input flow a{t). 
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The first system is the one of Figure 9.1 with service curve /3 and buffer X, 
whose losses L{t) are therefore given by (9.6). 

The second system is made of two parts, as shown in Figure 9.3(a). The first 
part is a system with storage capacity X, that realizes some mapping II ' of the input 
that is not explicitly given, but that is assumed to be isotone, and not smaller than 
n (n' > n). We also know that a first clipper discards data as soon as the total 
backlogged data in this system exceeds X. This operation is called buffer discard. 
The amount of buffer discarded data in [0,f] is denoted by I/Buf(i)- The second 
part is a policer without buffer, whose output is the min-plus convolution of the 
accepted input traffic by the policer by /). A second clipper discards data as soon as 
the total output flow of the storage system exceeds the maximum input allowed by 
the policer. This operation is called policing discard. The amount of discarded data 
by policing in [0, t] is denoted by Lpoi(f). 



Buffer Policer 




Lsufld 



Lpoi(t) 



(a) 



Buffer Virtual segregated system 




Figure 9.3; A storage/policer system with separation between losses due to 
buffer discard and to policing discard (a) A virtual segregated system for 
2 classes of traffic, with buffer discard and policing discard, as used by Lo 
Presti et al [50] (b) 



Theorem 9.3.1. Let L{t) be the amount of lost data in the original system, with 
service curve (3 and buffer X. 
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Let I/Buf (i) ( resp. I/poi(i ) ) be the amount of data lost in the time interval [0, t] 
by buffer ( resp. policing) discard, as defined above. 

Then L(t) < I/Buf(i) + ipoi(i)- 

Proof: Let x and y denote respectively the admitted and output flows of the 

buffered part of the second system. Then the policer implies that y = ^ x, and 

any time s we have 



a{s) - LBuf(s) - X = x{s) - X < y{s) < x{s) = a{s) - I/Buf(s)- 
which implies that for any 0 < u < v < t, 
y{v) - y{u) - fi{v - u) 

> (a(v) - Lbu((v) - X) - (a(u) - Lbu((u)) - (3{v - u) 

= a(v) - a(u) - (3{v - u) - X - (I/Buf(t') - -C'Buf(M))- 

We use the same reasoning as in the proof of Theorem 9.2.1; we pick any n € No 
and any increasing sequence {sk}i<,k< 2 n, with 0 < S 2 n < . . . < si < t. Then we 
set V = S 2 i-i and u = S 2 i in the previous inequality, and we sum over i, to obtain 

n 

^ [y{s 2 i-l) - y{s 2 i) - !^{s 2 i-l - S 2 i)] > 

n 

[a(s2i-l) - (l{s2i) - ( 3 {s 2 i-l - S2i) - X] 

n 

- ^ [(-C'Buf(s2*-l) - LBuf(s2*))] • 

By taking the supremum over all n and all sequences {s k}i<k< 2 n, the left hand side 
is equal to I/poi(f), because of (9.6) (we can replace the inequality in (9.6) by an 
equality, because the output of the policer is y = fi^x). Since {s is a wide-sense 
increasing sequence, and since Lbu( is a wide-sense increasing function, we obtain 
therefore 

Lpoi(t) > 

sup < sup [a{s 2 i-l) - a{s 2 i) - P{s 2 i -1 - S 2 i) - x]\ - LBuf{t) 

nSN \^0<S2n'^-'<Sl<t ) 

= L{t) - LBu({t), 

which completes the proof. □ 

Such a separation of resources between the “buffered system” and “policing 
system” is used in the estimation of loss probability for devising statistical CAC 
(Call Acceptance Control) algorithms as proposed by Elwalid et al [23], Lo Presti 
et al. [50]. The incoming trafflc is separated in two classes. All variables relating 
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to the first (resp. second) class are marked with an index 1 (resp. 2), so that a{t) = 
ai (t) + 02 (t) . The original system is a CBR shaper (/3 = Ac) and the storage system 
is a virtually segregated system as in Figure 9.3(b), made of 2 shapers with rates C j 
and Cl and buffers X | and Al| . The virtual shapers are large enough to ensure that 
no loss occurs for all possible arrival functions a\(t) and 02 (f). The total buffer 
space (resp. bandwidth) is larger than the original buffer space (resp. bandwidth): 
+ Al| > X (C| + C| > C). However, the buffer controller discards data as 
soon as the total backlogged data in the virtual system exceeds X and the policer 
controller discards data as soon as the total output rate of the virtual system exceeds 
C. 



9.3.2 Bound on Losses in a VBR Shaper 

In this second example, we consider of a “buffered leaky bucket” shaper [44] with 
buffer X, whose output must conform to a VBR shaping curve with peak rate P, 
sustainable rate M and burst tolerance B so that here the mapping of the element is 
B = Cj} with = Xp A 'Ym,b- We will consider two systems to bound these losses: 
first two CBR shapers in parallel (Figure 9.4(a)) and second two CBR shapers in 
tandem (Figure 9.4(b)). Similar results also holds for losses due to a delay constraint 
[47]. 



a(t) 



Clipper 

I 

L'CBR’(t) 



-► X [ ?)- 



a(t) 



Clipper 



O 



X+B 






(a) 



Clipper Clipper 




(b) 



Figure 9.4: Two CBR shapers in parallel (a) and in tandem (b). 
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We will first show that the amount of losses during [0,f] in this system is 
bounded by the sum of losses in two CBR shapers in parallel, as shown in Fig- 
ure 9.4(a): the first one has buffer of size X and rate P, whereas the second one has 
buffer of size X + B and rate M. Both receive the same arriving traffic a as the 
original VBR shaper. 

Theorem 9.3.2. Let Lvugf) be the amount of lost data in the time interval [0, f\ in a 
VBR shaper with buffer X and shaping curve (3 = Xp A when the data that 

has arrived in [0, t] is a(t). 

Let LcBR'it) (resp. LcBR"{f)) be the amount of lost data during [0,f] in a CBR 
shaper with buffer X (resp. {X + B)) and shaping curve Xp (resp. Xm) with the 
same incoming traffic a(f). 

Then LyBuif) ^ LcBR’if) P LcBR"(tf 

Proof: The proof is again a direct application of (9.6). Pick any 0 < u < v < t. 

Since (} = Xp A ^m,b, 

a(v) — a(u) — l3(v — u) — X = 

{a(t;) — a(u) — P(v — u) — X} V {a(?t) — a(u) — M(v — u) — B — X} 

Pick any n € No and any increasing sequence {s k}i<k< 2 n, with 0 < S 2 n < • • • < 
Si < t. Set V = S 2 i-i and u = S 2 i in the previous equation, and sum over i, to 
obtain 



n 

[a(s2i-i) - a(s2i) - / 3 (s 2 i-i - S2i) - X] 

n 

= - a(s2i) - P{s2i-1 - S2i) - X) 

V{a(s 2 *-i) - a(s2i) - M(s2i-i - S2i) - B - X} 

n 

< XI - a{s2i) - P{s2i-1 - S2i) - X] 

n 

+ X - (l{s 2 i) - M{s 2 i-l - S 2 i) - B - X] 

T/cbr' (f ) T Lqbr" (f ) : 

because of (9.6). By taking the supremum over all n and all sequences {s k}i<k< 2 n 
in the previous inequality, we get the desired result. □ 

A similar exercise shows that the amount of losses during [0,f] in the VBR 
system is also bounded above by the sum of losses in two CBR shapers in cascade 
as shown in Figure 9.4(b): the first one has buffer of size X and rate P, and receives 
the same arriving traffic a as the original VBR shaper, whereas its output is fed into 
the second one with buffer of size B and rate M. 
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Theorem 9.3.3. Let LysRit) be the amount of lost data in the time interval [0, t] in a 
VBR shaper with buffer X and shaping curve (3 = Xp A 7m, s. when the data that 
has arrived in [0, t] is a(t). 

Let LcBR'if) (resp. LcBR"{f)) be the amount of lost data during [0,i] in a CBR 
shaper with buffer X ( resp. B ) and shaping curve Xp ( resp. Am ) with the same 
incoming traffic a{t) (resp. the output traffic of the first CBR shaper). 

Then Lvbr (t) < L/cBR' (t) + Lcbr" (t). 

The proof is left as an exercise. 

Neither of the two systems in Figure 9.4 gives the better bound for any arbitrary 
traffic pattern. For example, suppose that the VBR system parameters are P = A, 
M = 1, B = 12 and X = 4, and that the traffic is a single burst of data sent at rate 
R during four time units, so that 

, . _ J i? • f if 0 < f < 4 
\ 4i? if t>4 

If i? = 5, both the VBR system and the parallel set of the two CBR' and CBR" 
systems are lossless, whereas the amount of lost data after five units of time in the 
tandem of the two CBR' and CBR'" systems is equal to three. 

On the other hand, if i? = 6, the amount of lost data after five units of time in the 
VBR system, the parallel system (CBR' and CBR") and the tandem system (CBR' 
and CBR'") are respectively equal to four, eight and seven. 

Interestingly enough, whereas both systems of Figure 9.4 will bound the amount 
of losses in the original system, it is no longer so for the congestion periods, i.e. the 
time intervals during which losses occur. The tandem system does not offer a bound 
on the congestion periods, contrary to the parallel system [47]. 



9.4 Solution to Skohorkhod’s Reflection Problem with 
Two Boundaries 

To obtain the model of Figure 9.1, we have added a regulator - called clipper - 
before the system itself, whose input x is the maximal input ensuring a lossless 
service, given a finite storage capacity X. The clipper eliminates the fraction of 
fresh traffic a that exceeds x. We now generalize this model by adding a second 
regulator after the lossless system, whose output is denoted with y, as shown on 
Figure 9.5. This regulator complements y, so that the output of the full process is 
now a given function b G (F. The resulting process N = y — b is the amount of 
traffic that needs to be fed to prevent the storage system to enter in starvation. N 
compensates for possible buffer underflows, hence we name this second regulator 
compensator. 

We can explicitly compute the loss process L and the “compensation” process 
N, from the arrival process a and the departure process b, using, once again, Theo- 
rem 4.3.1. We are looking for the maximal solution 
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Clipper Compensator 




I I 



L(t) N(t) 



Figure 9.5; A storage system representing the variables used to solve Sko- 
rokhod’s reflection problem with two boundaries 



x{t) = [x{t) 

where T denotes transposition, to the set of inequalities 



x{t) 


0<s<t 


(9.18) 


x{t) 


< y{t)+x 


(9.19) 


y{t) 


< x(t) 


(9.20) 


yit) 


0<s<t 


(9.21) 


The two first inequalities are identical to (9.1) and to (9.2). The two last inequalities 
are the dual constraints on y. We can therefore recast this system as 


X 


< a A ha{x) A {y + X} 


(9.22) 


y 


< bAxAhbix). 


(9.23) 



This is a system of min-plus linear inequalities, whose solution is 

X = Ch{o) = jC.jj(a), 

where H and a are defined as 

a{t) = [a{t) b{tr 



H{t,s) 



a(t) — a(s) So(t — s) + X 
So{t - s) b(t) - b{s) 



for all 0 < s < f. Instead of computing H, we go faster by first computing the 
maximal solution of (9.23). Using properties of the linear idempotent operator, we 
get 

y = hi(x A b) = hi(x A b) = hi(x) A hi(b) = hi(x). 

Next we replace y by hb{x) in (9.22), and we compute its maximal solution, which 
is 

X = ha A {hf, + X}{a). 
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We work out the sub-additive closure using Corollary 4.2.1, and we obtain 



X = (haO {hb -|-X})(a) 



and thus 



y= (hbohao {hb -b (a). 
After some manipulations, we get 



(9.24) 

(9.25) 



N{t) = b{t) - y{t) = 



sup < sup <. 

nGN I 0<52n + l<---<52<5l<t I 



r2n+l ^ 

' ^ (-l)*(a(s*) - h{si)) > - nX 



(9.26) 



L{t) = a{t) — x{t) = 



2n 



sup 



sup 



nSN |^0<52n V, « — 1 



^(-l)'+i(a(s,) - h{si)) \-nX\. (9.27) 



Interestingly enough, these two functions are the solution of the so-called Sko- 
rokhod reflection problem with two fixed boundaries [66, 33]. 

Let us describe this reflection mapping problem following the exposition of [40]. 
We are given a lower boundary that will be taken here as the origin, an upper bound- 
ary A > 0, and a. free process z{t) € IR such that 0 < 2:(0— ) < X. Skorokhod’s 
reflection problem looks for functions N{t) {lower boundary process) and L{t) (up- 
per boundary process) such that 



1 . The reflected process 



W{t) = z{f) + N{f) - L{f) (9.28) 

is in [0, X] for all f > 0. 

2. Both N{f) and L{f) are non decreasing with A(0— ) = L(0— ) = 0, and N{f) 
(resp. L{f)) increases only when W{f) =0 (resp. W{f) = X), i.e., with 1^ 
denoting the indicator function of A 




^{w(t)>o}dN{t) 



^{w(t)<x}dL{t) 



0 

0 



(9.29) 

(9.30) 



The solution to this problem exists and is unique [33]. When only one boundary 
is present, explicit formulas are available. For instance, if A oo, then there is 
only one lower boundary, and the solution is easily found to be 
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If X < 00 , then the solution can be constructed by successive approximations but, 
to our knowledge, no solution has been explicitly obtained. The following theorem 
gives such explicit solutions for a continuous VF function z{t). A VF function (VF 
standing for Variation Finie [33, 62]) z{t) on K+ is a function such that for all f > 0 

sup sup 

nGNo 0=s„<s„_i<...<si<so=i 

VF functions have the following property [62] : 2 : (t) is a VF function on K + if and 
only if it can be written as the difference of two wide-sense increasing functions on 

K+. 



n— 1 



^ \z{Si) - z{Si+i)\ ^ < 00 . 



i=0 



Theorem 9.4.1 (Skorokhod’s reflection mapping). Let the free process z(f) be a 
continuous VF function on K“*" . Then the solution to Skorokhod’s reflection problem 
on I 



X] is 




II 

CO 


f 

sup 


nSN 1 


[^0<52n + l <Sl <t 


II 

CO 


\ / 
sup { 


nSN 1 


1 0<52n <...<52<5 i <t ! 



'2n+l 



I ^(-l)'2(si)| -nXj (9.31) 
^(-l)'+i2(si)| -nxj. (9.32) 



Proof: As z{t) is a VF function on [0, 00 ), there exist two increasing functions 

a{t) and b{t) such that z{t) = a{t) — b{t) for all f > 0. As 2 ( 0 ) > 0, we can take 
6(0) = 0 and a(0) = 2 ( 0 ). Note that a,b £ X. 

We will show now that L = a — x and N = b — y, where x and y are the maximal 
solutions of (9.22) and (9.23), are the solutions of Skorokhod’s reflection problem. 
First note that 

W(t) = z(t)+N(t)-L(t) = (a(t)-b(t)) + (b(t)-y(t))-(a(t)-x(t)) = x(t)-y(t) 

is in [0, X] for alH > 0 because of (9.19) and (9.20). 

Second, because of (9.21), note that N{0) = 6(0) — y{0) = 0 and that for any 
t > 0 and 0 < s < t, N{t) — N{s) = b(f) — b{s) + y{s) — y(f) > 0, which 
shows that N(f) is non decreasing. The same properties can be deduced for L{t) 
from (9.18). 

Finally, if W (t) = x{t) — y{t) > 0, there is some s* € [0, t] such that y{t) = 
y{s*) + b{t) — b{s*) because y is the maximal solution satisfying (9.20) and (9.21). 
Therefore for all s € [s*, f], 

0 < N{t) - N{s) < N{t) - N{s*) = b{t) - b{s*) + y(s*) - y(t) = 0 

which shows that N(t) — N{s) =0 and so that N{t) is non increasing if W{t) > 0. 
A similar reasoning shows that L{t) is non increasing if W{t) < X. 

Consequently, N{t) and L{t) are the lower and upper reflected processes that 
we are looking for. We have already computed them: they are given by (9.26) and 
(9.27). Replacing a(s,) — 6(s,) in these two expressions by z(si), we establish (9.31) 
and (9.32). □ 
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9.5 Bibliographic Notes 

The clipper was introduced by Cruz and Tenaja, and was extended to get the loss 
representation formula presented in this chapter in [15, 47]. Explicit expressions 
when operator II is a general, time-varying operator, can be found in [15]. We expect 
results of this chapter to form a starting point for obtaining bounds on probabilities 
of loss or congestion for lossy shapers with complex shaping functions; the method 
would consist in applying known bounds to virtual systems and take the minimum 
over a set of virtual systems. 
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Ca (Vector min-plus convolution), 162 
Cct (Min-plus convolution), 162 
Vfj (Min-plus deconvolution), 162 
Vl (Packetization), 162 
Ch (Min-plus linear operator), 166 
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No, 256 

n (Max-plus operator), 161 
n (Min-plus operator), 161 
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(rate-latency function), 128 
T (set of wide-sense increasing func- 
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(5t (burst delay function), 128 
7r,() (affine function), 128 
h (horizontal deviation), 154 
hfj (Linear idempotent operator), 162 
\r (peak rate function), 128 
0 (min-plus deconvolution), 147 
0 (min-plus deconvolution), 155 
0 (min-plus convolution), 133 
0 (max-plus convolution), 155 
/ (sub-additive closure of /), 143 
T (Set of wide-sense increasing bi- 
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ut,t (staircase function), 129 
vt (step function), 129 

V (vertical deviation), 154 
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adaptive guarantee, 230 
affine function, 128 
arrival curve, 8 
Available Bit Rate, 243 

bivariate function, 161 
burst delay function, 128 
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CDVT (cell delay variation tolerance), 
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concave function, 132 
controlled load service, 89 
convex function, 132 
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Cumulative Packet Length, 50 
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damping tolerance, 117 
Delay Based Scheduler, 95 
DGCRA, 243 
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Earliest Deadline First (EDF) sched- 
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EDF see Earliest Deadline First, 95 
epigraph, 133 

Finite lifetime, 151 
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greedy source, 19 
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Guaranteed Rate scheduling, 87 
guaranteed service, 89 

horizontal deviation, 154 

idempotent, 171 
impulse response, 166, 169 
infimum, 125 
Intserv, 3 
isotone, 164 

limit to the left, 1 1 
limit to the right, 7 
linear idempotent operator, 162 
look ahead delay, 185 
lower semi-continuous, 164 

max-plus convolution, 155 
max-plus deconvolution, 155 
maximum, 155 
min-plus convolution, 133 
min-plus deconvolution, 147 
Min-plus linear, 165, 168 
minimum, 125 
minimum rateserver, 235 

Packet Scale Rate Guarantee, 234 
Packetizer, 5 1 
peak rate function, 128 
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playback buffer, 185 
playback delay, 185 
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Priority Node, 25, 208 
pseudo-inverse function, 129 

rate-latency function, 128 
Re-negotiable service, 243 
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shaper, 37 
shaping curve, 37 
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Shift matrix, 165 
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smoothing, 185 
staircase function, 129 
star-shaped function, 133 
step function, 129 
strict service curve, 27 
sub-additive closure, 143 
sub-additive fucntion, 140 
supremum, 155 

T-SPEC (traffic specification), 17 
time varying shaper, 244 

upper semi-continuous, 163 
utilization factor, 106 

variable capacity node, 27 
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Very good function, 20 
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